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NUCLEAR MAGNETIC RESONANCE ASSEMBLY OF CHEMICAL ENTITIES 
BACKGROUND OF THE INVENTION 



The present invention relates generally to drug 
discovery methods and, more specifically to Nuclear 
5 Magnetic Resonance (NMR) methods for identifying 
compounds that interact with macromolecules . 



Two general approaches have traditionally been 
used for drug discovery: structure-based drug design and 
screening for lead compounds. Structure -based drug 

10 design utilizes a three-dimensional structure model of a 
drug target to predict or simulate interactions with 
known or hypothetical compounds. Alternatively, in cases 
where a three-dimensional structure model of a drug 
target complexed with a ligand is available, therapeutic 

15 drugs can be designed to mimic the structural properties 
of the ligand, thereby identifying lead compounds for 
further development. 



Screening for lead compounds is' another 
approach that has been used with some success to identify 

20 lead compounds for therapeutic targets. Screening 

involves assaying a library of candidate compounds to 
identify lead compounds that interact -with a drug target. 
The probability of identifying a lead compound can be 
increased by providing increased numbers and variety of 

25 candidate compounds in the library to be screened. 

Synthetic methods are available for creating libraries of 
compounds and include, for example, combinatorial 
chemistry approaches in which selected chemical groups 
are variously combined to generate a library of candidate 
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compounds having diverse combinations of the selected 
chemical groups. In addition, advances have been made to 
increase the through-put for a number of screening 
methods. However, for many drug targets the throughput 
5 of available screens is prohibitively low. Furthermore, 
even in cases where high throughput detection is 
available, limitations on available resources for 
obtaining a library with sufficient size or diversity, or 
for obtaining a sufficient quantity of the drug target to 
10 support a large screen, can be prohibitive. 

The efficiency of library screening approaches 
can be increased by combining structure-based drug design 
with the methodologies currently available for library 
screening. In particular, the probability of identifying 

15 a lead compound in a screening approach can be increased 
by using focused libraries containing member compounds 
having a higher probability of interacting with the drug 
target. Focused libraries having members with a limited 
range of structural or functional variations have been 

20 obtained based on variations predicted from structure- 
based drug design methods and used to screen for 
candidate drugs. 

However, for many drug targets of interest, 
25 three-dimensional structure models are not presently 

available. Although methods for structure determination 
are evolving, it is currently difficult, costly and time 
consuming to determine the structure of a macromolecule 
drug target at sufficient resolution to render structure- 
30 based drug design practical. It can often be even more 
difficult to produce a macromolecule -ligand complex in a 
condition allowing a sufficiently resolved structure 



WO 03/023392 



PCT/US02/28745 



3 

model of the complex. The typically long time period 
required to obtain structure information useful for 
developing drug candidates is particularly limiting with 
regard to exploiting the growing number of potential drug 
5 targets identified by genomics research. 

Thus, there exists a need for methods to reduce 
the size and diversity of candidate libraries required to 
screen for lead compounds. The present invention 
satisfies this need and provides related advantages as 
10 well. 



SUMMARY OF THE INVENTION 

The invention provides a method for obtaining a 
focused library of candidate binding compounds for a 

15 protein family, wherein the members of the protein family 
bind a common ligand. The method includes the steps of 
(a) observing competitive binding of the common ligand 
and a first ligand to a protein, wherein the protein is a 
member of the protein family, thereby determining that 

20 the first ligand binds to the common ligand binding site 
of the protein; (b) providing a sample containing the 
protein, a first ligand and a second ligand under 
conditions wherein the first ligand, the second ligand 
and the protein form a bound complex; (c) detecting 

25 magnetization transfer between the first ligand and the 
second ligand in the bound complex, thereby determining 
that the two ligands are proximal in the bound complex; 
and (d) obtaining a population of candidate binding 
compounds including the first ligand, or a fragment 

30 thereof, linked to one of a plurality of second ligand 
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homologs, whereby the population contains binding 
compounds that bind to members of the protein family. 

The invention further provides a method for 
obtaining a focused library of candidate binding 
5 compounds, wherein the members of the protein family bind 
a common ligand. The method includes the steps of (a) 
providing a plurality of samples containing the protein 
and a first ligand under conditions wherein the first 
ligand and the protein form a bound complex, wherein the 

10 protein is a member of a family of proteins that bind a 
common ligand; (b) assaying a population of candidate 
second ligands for the ability to transfer magnetization 
to the first ligand in a sample from the plurality; (c) 
identifying, from the population of candidate second 

15 ligands, a second ligand that transfers magnetization to 
the first ligand, thereby determining that the two 
ligands are proximal to each other in a ternary bound 
complex with the protein; (d) observing competitive 
binding between one of the two ligands and the common 

20 ligand, thereby determining that the competitive binding 
ligand binds to the common ligand binding site of the 
protein; and (e) obtaining a population of candidate 
binding compounds including the competitive binding 
ligand, or a fragment thereof, linked to one of a 

25 plurality of homologs of the other ligand, whereby the 
population of candidate binding compounds contains 
binding compounds that bind to members of the protein 
family. 

The invention further provides a method for 
30 obtaining a focused library of candidate binding 

compounds for a protein family, wherein the members of 
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the protein family bind a common ligand. The method 
includes the steps of: (a) providing a ligand-probe 
having an antenna moiety, wherein the ligand-probe binds 
to the common ligand binding site of a protein, wherein 
5 the protein is a member of the protein family; (b) 

providing a sample containing the protein, the ligand- 
probe and a second ligand under conditions wherein the 
ligand-probe, the second ligand and the protein form a 
bound complex; (c) detecting magnetization transfer 

10 between the antenna moiety of the ligand-probe and the 
second ligand in the bound complex, thereby determining 
that the antenna moiety and second ligand are proximal in 
the bound complex; and (d) obtaining a population of 
candidate binding compounds comprising the ligand-probe, 

15 or a fragment thereof, linked to one of a plurality of 
second ligand homologs, whereby the population contains 
binding compounds that bind to members of the protein 
family. 

Also provided is a method for obtaining a 
20 focused library of candidate binding compounds, wherein 
the members of the protein family bind a common ligand. 
The method includes the steps of: (a) providing a 
ligand-probe having an antenna moiety, wherein the 
ligand-probef binds to the common ligand binding site of a 
25 protein, wherein the protein is a member of the protein 
family; (b) providing a plurality of samples containing 
the protein and the ligand-probe under conditions wherein 
the ligand-probe and the protein form a bound complex, 
wherein the protein is a member of a family of proteins 
30 that bind a common ligand; (c) assaying a population of 
candidate second ligands for the ability to transfer 
magnetization to the antenna moiety of the ligand-probe 



WO 03/023392 



PCT/US02/28745 



6 

in a sample from the plurality; (d) identifying, from the 
population of candidate second ligands, a second ligand 
that transfers magnetization to the antenna moiety of the 
ligand-probe, thereby determining that the two ligands 
5 are proximal to each other in a ternary bound complex 
with the protein; and (e) obtaining a population of 
candidate binding compounds comprising the ligand-probe, 
or a fragment thereof, linked to one of a plurality of 
homologs of the other ligand, whereby the population of 
10 candidate binding compounds contains binding compounds 
that bind to members of the protein family. 

Further provided is a method for identifying a 
compound having specificity for a particular member of a 
protein family, compared to other members of the protein 

15 family, prior to synthesizing the compound. The method 
includes the steps of: (a) observing competitive binding 
of the common ligand and a first ligand to a first 
protein, (b) observing competitive binding of the common 
ligand and a first ligand to a second protein, wherein 

20 the first and second proteins are members of the protein 
family, thereby determining that the first ligand binds 
to the common ligand binding site of the first and second 
proteins; (c) providing a sample containing the first 
protein, the first ligand and a second ligand; (d) 

25 providing a sample containing the second protein, the 
first ligand and the second ligand; (e) comparing the 
degree of magnetization transfer between the first ligand 
and the second ligand for the samples of parts (b) and 
(c) ; and (f ) obtaining a binding compound including the 

30 first ligand, or a fragment thereof, linked to the second 
ligand, or a fragment thereof, whereby the binding 
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compound selectively binds the first protein compared to 
the second protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic diagram of a method 
5 for sequentially building a binding compound from three 
ligands. 

Figure 2 shows structures for ligands that are 
proximal to PBBA when bound to p38ot MAP kinase, where the 
PBBA terminal methyl is represented by an asterisk in 
10 part b and atoms that have NOE interactions are indicated 
by arrows. 

Figure 3 shows NMR NOESY spectra for PBBA and 
inhibitor TTM2001 . 082 .B09 when bound to p38a (left panel) 
and structures for PBBA and inhibitor TTM2001 . 082 .B09 
15 (right panel) . 

Figure 4 shows the structure of the 
TTM2001.101.A09 bi-ligand and 2 HNMR spectra for the 
TTM2001.101.A09 bi-ligand in the absence (a) and presence 
(b) of 10 pM p38a MAP kinase. 

2 0 Figure 5 shows IC 50 values for inhibition of 

myelin basic protein phosphorylation by p38oc MAP kinase 
in the presence of inhibitor TTM2001 . 082 .B09, PBBA or 
TTM2 001. 101. AO 9 , respect i vely . 



Figure 6 shows structures for PBBA analogs that 
25 bind to p38a MAP kinase. Atoms that have NOE 
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interactions with TTM2001 . 070 . A10 are indicated by 
arrows . 

Figure 7 shows NMR NOESY spectra for PBBA and 
SB203580 when bound to p38a and structures for PBBA and 
5 SB203580 (upper panel) and a crystallographic structure 
model of the SB203580/p38a complex where the PBBA binding 
region is indicated by the white oval (lower panel) . 

Figure 8 shows a schematic diagram of the 
relative locations where ATP (dark circle) , myelin basic 
10 protein (white circle) , peptide (grey circle) and PBBA 
(area within the white circle and indicated by brackets) 
bind to p38oc MAP kinase . 

Figure 9 shows a structure model of p38a 
derived from Wang et al . Structure 6:1117-1128 (1998) in 
15 which residues are colored to indicate homology within 

the family of p38a-like proteins. The pentagon indicates 
the location of inhibitor SB203580 binding and the white 
circle indicates the location of PBBA binding, both 
determined from docking simulations. 

20 Figure 10 shows the structures of 

TTM2002.143.A27 and TTE0020 . 003 . A05 with NOE interactions 
indicated by arrows. 

Figure 11 shows the structures of 
TTE0020.003.A09 and TTM2002 . 143 . A27 with NOE interactions 
25 indicated by arrows. 
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Figure 12 shows the structures of 
TTM2002.143.A27 and TTE0020 . 002 .H10 with NOE interactions 
indicated by arrows. 

DETAILED DESCRIPTION OF THE INVENTION 



5 This invention provides a* method for 

identifying a compound that will bind to a macromolecule . 
Using a method of the invention, the relative positions 
of two or more ligands when bound to a macromolecule in a 
multipartite complex can be determined. Based on this 

10 determination, the ligands or portions of the ligands can 
be covalently linked to form a binding compound. An 
advantage of the invention is that atomic-resolution 
structural data for a macromolecule, although useful in 
some aspects of the invention, is not necessary in order 

15 to obtain a compound that binds to the macromolecule. 



The methods can also be used to design a 
library that is focused toward members of a particular 
protein family having a common ligand site (CLS) . The 
focused library can include compounds with various 

2 0 combinations of linked moieties, where the moieties are 

structurally similar to each ligand observed in a 
multipartite complex and the linker between the moieties 
is selected based on the relative positions of the 
ligands in the multipartite complex. A focused library 
25 can be designed by determining the relative positions of 
two or more ligands when bound to a macromolecule in a 
multipartite complex, identifying which of the ligands is 
a common ligand capable of binding to the CLS and 
building the library to contain members having the common 

3 0 ligand linked to various moieties that are structurally 
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similar to the other ligand. An advantage of the 
invention is that screening with one or a representative 
subset of proteins in a family can be used to design a 
library that is focused with respect to other proteins in 
5 the family. 

Nuclear Magnetic Resonance (NMR) can be used in 
a method of the invention to determine the relative 
proximity or positions of ligands in a multipartite 
complex with a macromolecule . In particular, proximal 

10 ligands can be identified from NMR-based observation of 
magnetization transfer between the ligands. Although NMR 
methods have been previously used to predict or determine 
the structure of ligands bound to macromolecules, these 
methods have relied upon detection of magnetic 

15 interactions between the ligand and the macromolecule. 
Isotopic labeling can be required for macromolecules in 
order to detect magnetic interactions with a bound 
. ligand. Furthermore, for many large or membrane bound 
macromolecules signal broadening, due in part to low 

20 rotational mobility, renders detection of magnetic 
interactions with ligands impractical. Because the 
methods of the present invention are based on detection 
of interactions between ligands and do not require 
detection of interactions with macromolecule, 

25 isotopically labeled macromolecules are not necessary. 
The methods are further advantageous for use with large 
or membrane bound macromolecules because observation of 
magnetization transfer between ligands can be enhanced 
when the ligands experience low rotational mobility. 

30 A further advantage of the invention is that 

ligands having relatively low affinities for a 
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macromolecule can be identified and linked to form a 
compound having substantially increased affinity for the 
macromolecule. Such increased affinity is expected to 
occur, for example, due to the chelate effect (for a 
5 description of the chelate effect see Page et al . , Proc. 
Natl, Acad. Sci. USA 68:1678-1683 (1971)) and is 
demonstrated in the Examples below. Another advantage of 
the invention is that a compound having increased 
specificity for a particular macromolecule, compared to a 

10 ligand from which it is assembled, can be identified. In 
particular, members of a CLS- containing protein family 
often have a different specificity ligand site adjacent 
to the common ligand binding site which provides a 
potential source of binding specificity (as described, 

15 for example, in U.S. Pat. App. No. 09/328,322 and WO 

99/60404) . By linking the common ligand to a particular 
specificity ligand, a compound can be obtained that has 
increased affinity due to the presence of both ligands 
and increased specificity for a particular member of a 

20 protein family compared to the common ligand. 

A ligand can include an antenna moiety that 
extends from the core structure of the ligand to interact 
with a proximal ligand. An antenna moiety can extend the 
range within which a proximal ligand is identified. The 

25 use of a mult i -probe ligand having multiple antennas or 
comparison of ligands having antennas of different 
composition or point of attachment on the ligand moiety 
can provide information on the relative orientation of 
the proximal ligands and their binding sites. Because 

30 the methods of the invention provide not only a 

functional identification that a ligand binds to a 
macromolecule, but also identify the relative positions 
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of two ligands when bound to the macromolecule, the 
invention provides structural information. Use of a 
method of the invention in a screening format provides a 
way to increase the throughput at which structural 
5 information can be obtained on the relative orientation 
of the proximal ligands and their binding sites. 

In the following description, for the purposes 
of explanation, specific details are set forth in order 
to provide a thorough understanding of the present 

10 invention. Those skilled in the art will understand that 
the present invention can be practiced without these 
specific details and can be applied to any of a variety 
of related systems. For example, although the methods 
are described in the context of ligands that bind to a 

15 protein, it is understood that the methods can be applied 
to other macromolecules including, for example, synthetic 
polymers, DNA, RNA or polysaccharides that interact with 
ligands . 

As used herein, the term "ligand" is intended 
20 to mean a molecule that can form a specific, non-covalent 
association with a macromolecule. A molecule included in 
the term can be a small molecule, a binding compound or a 
macromolecule. A molecule included in the term can be 
naturally occurring such as a DNA, RNA, polypeptide, 
25 protein, lipid, carbohydrate, amino acid, nucleotide, 
metabolite or hormone; a synthetic molecule; or a 
derivative of a naturally occurring molecule. A 
derivative can have, for example, an added moiety, a 
removed moiety or a rearrangement in the relative 
30 location of moieties compared to a naturally occurring 

molecule. As used herein, the term "binding compound'' is 
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intended to mean a ligand having a covalent structure 
that includes at least two moieties that interact with a 
macromolecule. 

As used herein, the term "binding site" is 
5 intended to mean a portion of a macromolecule or complex 
of macromolecules that associates specifically and non- 
covalently with a ligand or portion of a ligand. A non- 
covalent association included in the term can be due to a 
hydrogen bond, ionic interaction, van der Waals 
10 interaction, or hydrophobic interaction or a combination 
thereof . 

As used herein, the term "competitive binding" 
is intended to mean binding of a first ligand to a 
binding site of a macromolecule in a manner that prevents 
15 a second ligand from binding to the binding site. 

Accordingly, a first and second ligand that bind to a 
binding site of a macromolecule in a mutually exclusive 
manner are' understood to be competitive inhibitors of 
each other for the macromolecule, 

20 ^ As used herein, the term "bound complex" is 

intended to mean a specific non-covalent association 
between 2 or more molecules. The term can include a 
reversible association so long as the association is 
sufficiently stable to be observed by a binding assay. 

25 As used herein, the term "common ligand" or 

"CL" is intended to mean a molecule that specifically 
binds at a site conserved in a family of 2 or more 
macromolecules. The term can therefore extend to 
molecules that bind to members of a protein family or 
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gene family. Examples of common ligands include a 
natural common ligand which is normally found in 
biological systems or a common ligand mimic which has 
sufficient structural similarity to a natural common 
5 ligand that it can competitively inhibit binding of the 
natural common ligand to its common ligand binding site. 
Accordingly, a "common ligand site 7 ' is intended to mean a 
location in or on a macromolecule where a common ligand 
binds. A common ligand site is also referred to as a 
10 conserved site. 

The term "mimic , " when used in reference to a 
ligand, is intended to mean a molecule that binds to a 
protein at the same site as the ligand. The term can 
encompass molecules having portions similar to 
15 corresponding portions of the ligand in terms of 

structure or function. The term can also encompass the 
original ligand itself. 

An example of a useful CL is a cofactor or a 
cof actor mimic. A "cofactor" is any small molecule that 

20 binds in the CL site and participates in catalysis when 
bound to an enzyme. Cof actors often contain a nucleotide 
such as adenine mononucleotide or nicotinamide 
mononucleotide. Examples of such cof actors include ATP, 
ADP and SAM (S-adenosyl methionine) . Another group of 

25 cof actors that contain a nucleotide is the group NAD + , 
NADH, NADP* and . NADPH . Other such cof actors include 
FMNH 2 , FMN, FAD, FADH 2 , CoA, GTP and GDP. Still other 
cofactors include THF, DHF, TPP, biotin, dihydropterin, 
heme, pyridoxal phosphate and thiamine pyrophosphate. 

3 0 Other common ligands include conserved ligands such as 
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f arnesyl , f arnesyl -pyrophosphate , geranyl , geranyl - 
pyrophosphate or ubiquitin. 



As used herein, the term "family, " when used in 
reference to a macromolecule, is intended to mean a group 
5 of at least 2 macromolecules exhibiting structure 
homology and at least one function in common. An 
exemplary function included in the term is the ability to 
bind a common ligand such as NADH or ATP. Examples of 
enzyme families include kinases, dehydrogenases, 

10 oxidoreductases, GTPases, carboxyl transferases, acyl 
transferases , decarboxylases , transaminases , racemases , 
methyl transferases, formyl transferases, and 
a-ketodecarboxylases. As used herein, the term "enzyme 11 
refers to a molecule that binds a substrate ligand and 

15 carries out a catalytic reaction by converting the 
substrate ligand to a product. 



Enzymes can also be classified based on Enzyme 
Commission (EC) nomenclature recommended by the 
Nomenclature Committee of the International Union of 

2 0 Biochemistry and Molecular Biology (IUBMB) and available 

from the ENZYME database, (available on the internet at 
expasy.ch/ enzyme/; administered by The Swiss Institute 
for Bioinformatics, Switzerland; see, for example, 
Bairoch, Nucl . Acid. Res. 28:304-305 (2000)). For 
25 example, oxidoreductases are classified as 

oxidoreductases acting on the CH-OH group of donors with 
NAD + or NADP + as an acceptor (EC 1.1.1); oxidoreductases 
acting on the aldehyde or oxo group of donors with NAD + or 
NADP+ as an acceptor (EC 1.2.1); oxidoreductases acting on 

3 0 the CH-CH group of donors with NAD* or NADP + as an 

acceptor (EC 1.3.1); oxidoreductases acting on the CH-NH 2 
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group of donors with NAD* or NADP + as an acceptor (EC 
1.4.1); oxidoreductases acting on the CH-NH group of 
donors with NAD + or NADP + as an acceptor (EC 1.5.1); 
oxidoreductases acting on NADH or NADPH (EC 1.6); and 
5 oxidoreductases acting on NADH or NADPH with NAD + or NADP + 
as an acceptor (EC 1.6.1). 

Additional oxidoreductases include 
oxidoreductases acting on a sulfur group of donors with 
NAD + or NADP + as an acceptor (EC 1,. 8.1); oxidoreductases 

10 acting on diphenols and related substances as donors with 
NAD* or NADP+ as an acceptor (EC 1.10.1); oxidoreductases 
acting on hydrogen as donor with NAD + or NADP + as an 
acceptor (EC 1.12.1); oxidoreductases acting on paired 
donors with incorporation of molecular oxygen with NADH 

15 or NADPH as one donor and incorporation of two atoms (EC 
1.14.12) and with NADH or NADPH as one donor and 
incorporation of one atom (EC 1.14.13); oxidoreductases 
oxidizing metal ions with NAD + or NADP + as an acceptor (EC 
1.16.1); oxidoreductases acting on -CH 2 groups with NAD + 

20 or NADP + as an acceptor (EC 1.17.1); and oxidoreductases 
acting on reduced ferredoxin as donor, with NAD + or NADP+ 
as an acceptor (EC 1.18.1). 

Other enzymes include transferases classified 
as transferases transferring one-carbon groups (EC 2.1); 
25 methyltransf erases (EC 2.1.1); hydroxymethyl- , formyl- 
and related transferases (EC 2.1.2); carboxyl- and 
carbamoyl transf erases (EC 2.1.3); acyltransf erases (EC 
2.3); and transaminases (EC 2.6.1). Additional enzymes 
include phosphotransferases such as phosphotransferases 
0 transferring phosphorous -containing groups with an 
alcohol as an acceptor (kinases) (EC 2.7.1); 
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phosphotransferases with a carboxyl group as an acceptor 
(EC 2.7.2); phosphotransf er with a nitrogenous group as 
an acceptor (EC 2.7.3); phosphotransferases with a 
phosphate group as an acceptor (EC 2.7.4); and 
5 diphosphotransf erases (EC 2.7.6) . 

Protein or gene family members can often be 
identified by the presence of a conserved structural 
motif as described, for example, in Branden and Tooze 
Introduction to Protein Structure . Garland Publishing 

10 Inc., New York (1991). A structural motif can be 

identified at the primary structure level according to a 
particular nucleotide or amino acid sequence or at the 
tertiary structure level due to a particular combination 
or orientation of secondary structure elements. 

15 Identification of structural motifs using structural 
alignments is described in further detail below. 

Several large protein and gene families have 
been identified, including families having as many as 2 0 
or more, 50 or more, 100 or more and even 200 or more 

20 members. Two particular examples of a protein or gene 
family are kinases and oxidoreductases . The term 
"kinase" herein means any enzyme that catalyzes the 
transfer of a phosphoryl group from ATP or other 
nucleoside triphosphate to another compound. The term 

25 "oxidoreductase" herein means any enzyme that catalyzes 
an oxidation-reduction reaction. Still other gene 
families include transaminases, decarboxylases and 
methyltransf erases . 

Another particular gene family is the 
3 0 dehydrogenase gene family. The term "dehydrogenase" 
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herein means any enzyme that catalyzes the removal of 
hydrogen from a substrate using a compound other than 
molecular oxygen as an acceptor. Typically the hydrogen 
is transferred to the coenzyme NAD* (nicotinamide adenine 
5 dinucleotide) or NADP + (nicotinamide adenine dinucleotide 
phosphate) . The dehydrogenase gene family is large, 
containing approximately 17% of .all enzymes (You, 
Kwan-sa, u Stereospecif icity for Nicotinamide Nucleotides 
in Enzymatic and Chemical Hydride Transfer Reactions/' 
10 CRC Crit. Rev. Biochem. 17:313-451 (1985)). Thus, the 
dehydrogenase family is likely to be a rich source of 
drug targets. 

As used herein, the term "specificity" refers 
to the ability of a ligand to selectively bind to one 

15 macromolecule over another. For example, the term can 
include selective binding of a ligand to one member of a 
protein family compared to other proteins outside of or 
within the protein family. The selective binding of a 
particular ligand to a macromolecule is measurably higher 

20 than the binding of the ligand to at least one other 

molecule. Specificity can also be exhibited over two or 
more, three or more, four or more, five or more, six or 
more, seven or more, ten or more, or even twenty or more 
other macromolecules . 

25 As used herein, the term "structure model" is 

intended to mean a representation of the relative 
locations of atoms of a molecule. A representation 
included in the term can be defined by a coordinate 
system that is preferably in 3 dimensions, however, 

30 manipulation or computation of a model can be performed 
in 2 dimensions or even 4 or more dimensions in cases 
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where such methods are desired. The location of atoms in 
a molecule can be described, for example, according to 
bond angles, bond distances, relative locations of 
electron density, probable occupancy of atoms at points 
5 in space relative to each other, probable occupancy of 
electrons at points in space relative to each other or 
combinations thereof. A representation included in the 
term can contain information for all atoms of a 
particular molecule or a subset of atoms thereof. 

10 Examples of representations included in the term that 
contain a subset of atoms are those commonly used for 
polypeptide structures such as ribbon diagrams, and the 
like, which show the coordinates of the polypeptide 
backbone while omitting coordinates for all or a portion 

15 of the side chain moieties of the polypeptide. 

Representations for other macromolecules and small 
molecules included in the term can similarly contain all 
or a subset of atoms. 

A structure model can include a representation 
20 that is determined from empirical data derived from, for 
example, X-ray crystallography or nuclear magnetic 
resonance spectroscopy. A representation included in the 
term can include one that is derived from a theoretical 
calculation including, for example, a structure obtained 
25 by homology modeling or aJb initio modeling. A 

representation of a structure model can include, for 
example, an electron density map, atomic coordinates, 
x-ray structure model, ball and stick model, density map, 
space filling model, surface map, Connolly surface, Van 
30 der Waals surface or CPK model. 
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As used herein, the term "docking 11 is intended 
to mean using a model of a first and second molecule to 
simulate association of the first and second molecule at 
a proximity sufficient for at least one atom of the first 
5 molecule to be within bonding distance of at least one 
atom of the second molecule. The term is intended to be 
consistent with its use in the art pertaining to 
molecular modeling. A model included in the term can be 
any of a variety of known representations of a molecule 
10 including, for example, a graphical representation of its 
three-dimensional structure, a set of coordinates, set of 
distance constraints, set of bond angle constraints or 
set of other physical or chemical properties or 
combinations thereof. 

15 As used herein, the term "magnetization 

transfer" is intended to mean a through-space alteration 
of the nuclear magnetic resonance properties of an atomic 
nucleus of a first atom due to a proximal atomic nucleus 
or at least one electron of a proximal atom. An 

20 alteration included in the term can occur due to the 
Nuclear Overhauser Effect (NOE) or cross saturation. 
Proximal atomic nuclei included in the term are those 
that are within a distance sufficient to cause a magnetic 
interaction detectable by a nuclear magnetic resonance 

25 spectroscopy measurement used in the methods of the 

invention. Examples of magnetic effects included in the 
term are a relaxation effect which can be detected for 
atoms that are about 10 A apart or closer, the Nuclear 
Overhauser Effect which can be detected for atoms that 

30 are about 6 A apart or closer or chemical shift due to 

shielding or de-shielding which can be detected for atoms 
that are about 10 A or closer. Atoms that are about 5 A 
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apart or closer, 4 A apart or closer, 3 A apart or 
closer, 2 A apart or closer or 1 A apart or closer are 
also proximal atoms that are included in the term. 

As used herein, the term "linker" is intended 
5 to mean one or more atoms that covalently connect a first 
moiety to a second moiety. A moiety included in the term 
can be a ligand such as a common ligand, or fragment 
thereof; a specificity ligand, or fragment thereof; or a 
mimic of a common ligand or specificity ligand. A linker 
10 can provide positioning and orientation of a first moiety 
relative to a second moiety such that one moiety can bind 
to a first ligand site and the other moiety can bind to a 
second, proximal site on a macromolecule . 

As used herein, a "library" is intended to mean 

15 a population of different molecules. The library is 
chemically synthesized and contains primarily the 
components generated during the synthesis. A population 
included in the term can include two or more different 
molecules. A population can be as large as the number of 

20 individual molecules currently available to the user or 
able to be made by one skilled in the art. A population 
can be as small as two molecules and as large as 10 10 
molecules. Generally, a population will contain two or 
more, three or more, five or more, nine or more, ten or 

25 more, twelve or more, fifteen or more, or twenty or more 
different molecules. A population can also contain tens 
or hundreds of different molecules or even thousands of 
different molecules. For example, a population can 
contain about 20 to about 100,000 different molecules or 

30 more, for example about 25 or more, 3 0 or more, 40 or 
more, 50 or more, 75 or more, 100 or more, 150 or more, 
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200 or more, 300 or more, 500 or more, or 1000 or more 
different molecules, and particularly about 10,000, 
100,000 or even IxlO 6 or more different molecules. A 
population of synthetic compounds can be derived, for 
5 example, by chemical synthesis and is substantially free 
of naturally occurring substances. 

As used herein, the term "homolog" is intended 
to mean a molecule or moiety of a molecule that has 
similar structure in comparison to a reference molecule 
10 or moiety. A moiety is a group of atoms that form a part 
or portion of a larger molecule. A moiety can consist of 
any number of atoms in a portion of a molecule and can 
correlate with a physical or chemical property conferred 
upon the molecule by the combined atoms. 

15 As used herein, the term "ligand- probe" is 

intended to mean a molecule that can selectively bind a 
protein and that has an antenna moiety and a ligand 
moiety. A "ligand moiety" is a fragment of a ligand - 
probe, that when lacking the antenna moiety, is capable 

2 0 of selectively binding to the protein. An "antenna 
moiety" is a structure containing an NMR- observable 
nucleus that is attached to a ligand moiety by bonding to 
at least 1 intervening atoms. A larger number of atoms 
can intervene between an NMR-observable nucleus and 

25 ligand moiety including, for example, at least 2, 3, 4, 
5, 6, 7, 8, 9, 10 or more intervening atoms. The 
intervening atoms can form an aliphatic chain that, when 
attached to a ligand moiety having aromatic rings, allows 
selective excitation due to differences in frequency for 

30 excitation or saturation of aliphatic and aromatic 

protons. An NMR- observable nucleus of an antenna moiety 
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can be a proton that is isolated from vicinal proton 
coupling. Isolation from vicinal proton coupling 
provides for selective observation of direct NOE transfer 
at short mixing times compared to indirect NOE transfer 
5 via spin diffusion and also reduces signal loss due to 
relaxation effects that occur for vicinal coupled 
protons. A proton can be isolated from vicinal proton 
coupling by being attached to a carbon that is adjacent 
to an atom that lacks protons including, for example, an 
10 ether oxygen; carbonyl carbon; thioether, sulfone or 
sulfoxide sulfur; deuterated carbon or selenium, or by 
being attached to a carbon that is adjacent to an atom 
having fast exchanging protons such as a nitrogen at high 
pH. 

15 The invention provides a method for assembling 

a binding compound. The method includes the steps of (a) 
obtaining a sample containing a macromolecule, a first 
ligand and a second ligand under conditions wherein a 
bound complex is formed containing the first ligand, the 

20 second ligand and the macromolecule; (b) detecting 

magnetization transfer between the first ligand and the 
second ligand in the bound complex; (c) determining from 
the magnetization transfer the distance between an atom 
of the first ligand and an atom of the second ligand in 

25 the bound complex; and (d) obtaining a binding compound 
including the first ligand , or a fragment thereof, the 
second ligand, or a fragment thereof, and a linker, 
whereby the binding compound is capable of binding the 
macromolecule . 

30 The invention further provides a method for 

obtaining a focused library of candidate binding 
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compounds for a protein family, wherein the members of 
the protein family bind a common ligand. The method 
includes the steps of (a) observing competitive binding 
of the common ligand and a first ligand to a protein, 
5 wherein the protein is a member of the protein family, 
thereby determining that the first ligand binds to the 
common ligand binding site of the protein; (b) providing 
a sample containing the protein, a first ligand and a 
second ligand under conditions wherein the first ligand, 

10 the second ligand and the protein form a bound complex; 
(c) detecting magnetization transfer between the first 
ligand and the second ligand in the bound complex, 
thereby determining that the two ligands are proximal in 
the bound complex; and (d) obtaining a population of 

15 candidate binding compounds including the first ligand, 
or a fragment thereof, linked to one of a plurality of 
second ligand homologs, whereby the population contains 
binding compounds that bind to members of the protein 
family. 

20 A schematic overview that exemplifies a method 

for assembling a binding compound is shown in Figure 1. 
At step 1, a ligand, shown as F x , is identified based on 
the observation that it binds to a protein. The binding 
can be observed based on magnetization transfer between 

25 the protein binding site and the F x ligand. The F 1 ligand 
can be obtained from a screen in which a library of 
candidate ligands are tested for the ability to bind the 
protein. At step 2, the F 2 ligand is identified as 
binding to the protein at a location that is proximal to 

30 the F 1 ligand. Proximal ligands can be identified based 
on the observation of magnetization transfer between the 
F 1 ligand and F 2 ligand when in a complex with the 
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protein. The F 2 ligand can be identified from a screen 
using a library of candidate ligands. Based on the 
observed magnetization transfer a bi- ligand compound can 
be obtained in which the F x and F 2 ligands are linked. 
5 Alternatively, fragments or homologs of either ligand can 
be linked to form a bi-ligand. As shown in step 2', the 
bi-ligand compound can be used to identify a third 
proximal ligand, shown as ligand F 3 , and a tri -ligand can 
be subsequently obtained in which the F lt F 2 and F 3 
10 ligands are linked. These and related methods are set 
forth in further detail below. 

Initially, a macromolecule target such as a 
protein is identified for the development of a binding 
compound. In one embodiment, a macromolecule target for 

15 development of a therapeutic agent can be identified 
based on its presence in a pathogen or its association 
with a disease or condition. For example, a protein 
target present in a pathogen can be selected as the 
target to develop drugs effective in combating a disease 

20 caused by that pathogen. Any pathogen can be selected as 
a target organism. Examples of pathogens include, for 
example, bacteria, fungi or protozoa. 

Pathogenic bacteria useful as target organisms 
include Staphylococcus, Mycobacteria, Mycoplasma, 

25 Streptococcus, Haemophilus, Neisseria, Bacillus, 
Clostridium, Corynebacteria, Salmonella, Shigella, 
Vibrio, Campylobacter, Helicobacter, Pseudomonas, 
Legionella, Bordetella, Bacteriodes, Fusobacterium, 
Yersinia, Actinomyces , Brucella, Borrelia, Rickettsia, 

30 Ehrlichia, Coxiella, Chlamydia, and Treponema. 
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Pathogenic strains of Escherichia coli can also be target 
organisms. 

Binding compounds targeted to macromolecules in 
these pathogenic bacteria are useful for treating a 
5 variety of diseases including bacteremia, sepsis, 

nosocomial infections, pneumonia, pharyngitis, scarlet 
fever, necrotizing fasciitis, abscesses, cellulitis, 
rheumatic fever, endocarditis, toxic shock syndrome, 
osteomyelitis, tuberculosis, leprosy, meningitis, 

10 pertussis, food poisoning, enteritis, enterocolitis, 
diarrhea, gastroenteritis, shigellosis, dysentery, 
botulism, tetanus, anthrax, diphtheria, typhoid fever, 
cholera, actinomycosis, Legionnaire's disease, gangrene, 
brucellosis, lyme disease, typhus, spotted fever, 

15 Q fever, urethritis, vaginitis, gonorrhea and syphilis. 

For example, Staphylococcus aureus is a major 
cause of nosocomial infections and has become 
increasingly resistant to a variety of antibiotics over 
recent years. Similarly, Mycobacteria tuberculosis has 

20 become increasingly resistant to multiple antibiotics in 
recent years. M. tuberculosis infects almost one third 
of the world population, with active tuberculosis found 
in almost 10 million people worldwide and in AIDS 
patients as a common opportunistic infection. 

25 Streptomyces has also become increasingly resistant to 
antibiotics over recent years. Therefore, these 
pathogenic bacteria with known resistance and target 
macromolecules required for their growth or pathogenesis 
are particularly desirable as target organisms for which 

30 therapeutic binding compounds can be identified. 
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In another embodiment, target organisms are 
selected from yeast and fungi. Pathogenic yeast and 
fungi useful as target organisms include- Aspergillus, 
Mucor, Rhizopus, Candida, Cryptococcus , Blastomyces, 
5 Coccidioides , Histoplasma, Paracoccidioides, Sporothrix, 
and Pneumocystis. Binding compounds targeted to 
macromolecules in these pathogenic yeast and fungi are 
useful for treating a variety of diseases including 
aspergillosis , zygomycosis , candidiasis , cryptococcoses , 
10 blastomycosis , coccidioidomycosis , histoplasmosis , 

paracoccidioidomycosis, sporotrichosis, and pneuomocystis 
pneumonia . 

In still another embodiment, target organisms 
are selected from protozoa. Pathogenic protozoa useful 
15 as target organisms include Plasmodium, Trypanosoma, 
Leishmania, Toxoplasma, Cryptosporidium, Giardia, and 
Entamoeba. Binding compounds targeted to macromolecules 
in these pathogenic protozoa are useful for treating a 
variety of diseases including malaria, sleeping sickness, 

2 0 Chagas' disease, leishmaniasis, toxoplasmosis, 

cryptosporidiosis, giardiasis, and amebiasis. 

In addition, a target cell such as a cancer 
cell can be selected to identify drugs effective for 
treating cancer. Examples of such target cells include, 
25 for example, breast cancer, prostate cancer, and ovarian 
cancer cells as well as leukemia, lymphomas, melanomas, 
sarcomas and gliomas. Binding compounds directed to a 
target macromolecule in a cancer cell are useful for 
targeted delivering of a chemo therapeutic agent or for 

3 0 inhibition of unregulated growth. Diagnosis and 

identification of causative factors or pathogens for a 
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targeted disease can be determined using methods known in 
the art as described for example in The Merck Manual , 
Sixteenth Ed, (Berkow, R. , Editor) Rahway, N.J. ,1992. 

A macromolecule family to which a, target 
5 macromolecule belongs can be identified according to 

structural or functional similarities using methods known 
in the art. Structural similarity can be identified, for 
example, by sequence analysis at the nucleotide or amino 
acid level. One method for determining if two 

10 macromolecules are related is BLAST, Basic Local 

Alignment Search Tool, (available on the internet at 
ncbi.nlm.nih.gov/BLAST/; administered by The National 
Center for Biotechnology Information, Bethesda Maryland) . 
BLAST is a set of similarity search programs designed to 

15 examine all available sequence databases and can function 
to search for similarities in protein or nucleotide 
sequences. A BLAST search provides search scores that 
have a well-defined statistical interpretation. 
Furthermore, BLAST uses a heuristic algorithm that seeks 

20 local alignments and is therefore able to detect 

relationships among sequences which share only isolated 
regions of similarity (Altschul et al . , J. Mol . Biol. 
215:403-410 (1990) ) . 

In addition to the originally described BLAST 
25 (Altschul et al . , supra, 1990), modifications to the 

algorithm have been made (Altschul et al . , Nucleic Acids 
Res. 25:3389-3402 (1997)). One modification is Gapped 
BLAST, which allows gaps, either insertions or deletions, 
to be introduced into alignments. Allowing gaps in 
30 alignments tends to reflect biologic relationships more 
closely. A second modification is PSI -BLAST, which is a 
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sensitive way to search for sequence homologs. PSI-BLAST 
performs an initial Gapped BLAST search and uses 
information from any significant alignments to construct 
a position- specific score matrix, which replaces the 
5 query sequence for the next round of database searching. 
A PSI-BLAST search is often more sensitive to weak but 
biologically relevant sequence similarities. 

A second resource for identifying members of a 
protein family is PROSITE. (Available on the internet at 

10 expasy.ch/sprot/prosite.html; administered by The Swiss 
Institute for Bioinf ormatics, Switzerland) . PROSITE is a 
method of determining the function of uncharacterized 
proteins translated from genomic or cDNA sequences 
(Bairoch et al . , Nucleic Acids Res. 25:217-221 (1997)). 

15 PROSITE consists of a database of biologically 

significant sites and patterns that can be used to 
identify which known family of proteins, if any, the new 
sequence belongs. In some cases, the sequence of an 
unknown protein is too distantly related to any protein 

20 of known structure to detect its resemblance by overall 
sequence alignment. However, related proteins can be 
identified by the occurrence in its sequence of a 
particular cluster of amino acid residues, which can be 
called a pattern, motif, signature or fingerprint. 

25 PROSITE uses a computer algorithm to search for motifs 
that identify proteins as family members. PROSITE also 
maintains a compilation of previously identified motifs, 
which can be used to determine if a newly identified 
protein is a member of a known protein family. 

30 Members of a protein family can also be 

identified by clustering binding site structures or bound 
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ligand conformations as described, for example, in U.S. 
Patent Application No. 10/040,895. A sequence model such 
as a Hidden Markov Model, representing the frequency and 
order with which specific amino acids or gaps occur in 
5 the binding sites of protein family members can be used 
to search a sequence database and identify other members 
as described, for example, in U.S. Patent Application No. 
10/04 0,895. Members of a protein family can also be 
identified by clustering their sequence comparison 
10 signatures, where a sequence comparison signature for a 
protein is a string of pairwise comparison scores for the 
protein compared to the other proteins in a database as 
described, for example, in U.S. Patent Application No. 
10/032,395. 

15 Another resource for identifying members of a 

protein family is Structural Classification of Proteins 
(SCOP, Available on the internet at 

scop.mrc-lmb.cam.ac.uk/scop/, administered by Medical 
Research council, Cambridge, England, (which is 
20 incorporated herein by reference) . SCOP maintains a 
compilation of previously determined protein tertiary 
folds from which structural comparison can be made to 
identify protein family members having similar motifs 
(Murzin et al . , J. Mol. Biol. 247:536-540 (1995)). 
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Table 1. Databases for Identifying Protein Family Motifs 



SEARCHABLE MOTIF AND 
PATTERN DATABASES 



WEBSITES 



PROSITE 



expasy . hcuge . ch/sprot /prosi te . html 



5 BLOCKS 



blocks . f here . org/blocks search . html 



PRINTS 



biochem.ucl . ac .uk/bsm/dbbrowser/PRINT 
3/ PRINTS. html 



PIMA 



dot . imgen . bem . tmc . edu : 9 3 3 1 / seq- 
search/protein- search . html 



10 PRODOM 



protein. toulouse . inra . f r/prodom. html 



MOTIF AND PROFILE SEARCHES 

REGULAR EXPRESSION SEARCH 
PROFILESEARCH 

15 PATS CAN 

PATTERNFIND 



WEBSITES 

ibc . wust 1 . edu/ f pat/ 

segnet . dl . ac .uk/hhg/PR 
OFILESE.html 

c . mcs . anl . gov/home/ove 
rbeek/ Pat Scan/HTML/pat 
scan . html 

ulrec3 .unil .ch/softwar 
e/PATFND-mailf orm. html 
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PROFILE 
PMOTIF 

5 

HMMER 

WWW AND FTP SERVERS FOR 
SINGLE SEQUENCE EXHAUSTIVE 
10 DATABASE SEARCHES 

BLAST 
BLITZ 



lent i . med . umn . edu/MolB 

io_man/chp- 

10.html#HDRl 

alces . med . umn . edu/pmot 
if .html 

genome . wustl . edu/eddy/ 
HMMER/ 



WEBSITES 

ncbi . nlm . nih . gov/BLAST/ 

ebi.ac.uk/searches/blitz_in 
put . html 



FASTA 

15 



genome . ad . jp/ ideas/ f asta 
/f asta_genes .html 
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FTP ADDRESSES FOR MOTIF 
AND PROFILE SEARCH PROGRAMS 



WEBSITES 



BARTON'S FLEXIBLE PATTERNS 



geof f . biop . ox . ac . uk/ 



PROPAT 



mdc -berl in . de/ 



5 SOM 



f tp .mdc- 

berlin. de/pub/neural 



SEARCHWISE 



sable . ox . ac . uk/pub/use 
rs 



10 



PROFILE 



ftp.ebi .ac.uk/pub/soft 
ware/unix/ 



TPROFILESEARCH 



ftp . ebi . ac . uk/pub/sof t 
are/vax/egcg 



CAP 



ncbi .nlm.nih.gov/ pub/k 
oonin/cap 



15 Additional resources for identifying motifs of 

a protein family are shown in Table 1 . The websites 
cited therein are incorporated by reference. 



Conserved amino acids are evolutionarily 
conserved and carry out a common function. For example, 
20 the Rossman fold is a tertiary structural motif that 

includes GXXGXXG or GXGXXG and is present in enzymes that 
bind nucleotides (Brandon and Tooze, in Introduction to 
Protein Structure . Garland Publishing, New York (1991)). 
Enzymes that bind nucleotides such as NAD, NADP, FAD, 
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ATP, ADP, AMP and FMN contain the Rossman fold sequence 
motif (Creighton, Proteins: Structures and Molecular 
Principles , , p. 368, W.H. Freeman, New York (1984)). 
Additional conserved residues as well as different 
5 protein structures distinguish protein families that 

bind, for example, NAD from those that bind, for example, 
ATP . 

An example of a recognizable protein motif or 
fingerprint is found in dinucleotide binding proteins 

10 such as dehydrogenases (Rossman et al., in The Enzymes 

Vol 11, Part A , 3rd ed. ; Boyer, ed., pp. 61-102, Academic 
Press, New York (1975); Wierenga et al . , J. Mol . Biol. 
187:101-107 (1986); and Ballamacina, FASEB J. 10:1257- 
1269 (1996)). The fingerprint region contains a 

15 phosphate binding consensus sequence GXXGXXG or GXGXXG, a 
hydrophobic core of six small hydrophobic residues, a 
conserved, negatively charged residue that binds to the 
ribose 2 1 hydroxy 1 of adenine and a conserved positively 
charged residue (Bellamacina, supra) . 

20 Protein kinases also have recognizable motifs 

conserved among all known protein kinases (Hanks and 
Quinn, Methods Enzvmol . 200:28-62 (1991)). Eight 
invariant amino acid residues are conserved throughout 
the protein kinase family, including a conserved GXGXXG 

25 motif similar to that seen in dinucleotide binding 

proteins. A crystallographic molecular model of cyclic 
AMP-dependent protein kinase as well as other protein 
kinases showed that these conserved residues are nearly 
all associated with essential, conserved functions such 

30 as ATP binding and catalysis (Knighton et al . , Science 

253:407-414 (1991); and Knighton et al . , Science 253:414- 
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420 (1991)). Thus, conserved amino acid residues, which 
are common to members of a protein family, are 
recognizable as a motif critical for the structure, 
function or activity of a protein. 

5 Pyridoxal binding proteins also^ have 

recognizable motifs. One motif is GXGGXXXG, a second 
motif is KXEX 6 SXKX 5 _ 6 M, and a third motif is PXNPTG (Suyama 
et al., Protein Engineering 8:1075-1080 (1995)). 

A macromolecule family can be selected based on 
10 a conserved and recognizable structural motif such as a 
primary sequence motif, tertiary structure motif, or 
both. Members of a macromolecule family can also be 
recognized based on similar function. For example, a 
protein family can be identified based on the ability of 
15 its members to bind a natural common ligand that is 
already known. For example, it is known that 
dehydrogenases bind to dinucleotides such as NAD or NADP. 
Therefore, NAD or NADP are natural common ligands to a 
number of dehydrogenase family members. Similarly, 
20 kinases bind ATP, which is therefore a natural common 
ligand to kinases. Other natural common ligands of a 
macromolecule family can be the coenzymes and cofactors 
■ described above. 



After a target macromolecule is selected, the 
25 selected macromolecule or a functional fragment thereof 
can be isolated for use in the methods. A functional 
fragment of a macromolecule is a fragment that is capable 
of binding at least one ligand that is bound by the full 
length macromolecule. The macromolecule or fragment can 
30 be isolated from a native tissue or organism, from a 
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population of cells maintained in culture, or from a 
recombinant organism or cell culture. Methods for 
isolating a protein are known in the art and are 
described, for example, in Scopes, Protein Purification: 
5 Principles and Practice , 3 rd Ed. , Springer-Verlag, New 
York (1994); Duetscher, Methods in Enzymology , Vol 182, 
Academic Press, San Diego (1990); and Coligan et al . , 
Current protocols in Protein Science , John Wiley and 
Sons, Baltimore, MD (2000) . 



10 A target macromolecule can be cloned and 

expressed in a recombinant organism using methods that 
are known to those skilled in the art including, for 
example, polymerase chain reaction (PCR) and other 
molecular biology techniques (Dieffenbach and Dveksler, 

15 eds., PCR Primer: A Laboratory Manual , Cold Spring Harbor 
Laboratory Press, Plainview, NY (1995); Sambrook et al . , 
Molecular Cloning: A Laboratory Manual , 2nd ed., Cold 
Spring Harbor Laboratory Press, Plainview, NY (1989) ; 
Ausubel et al . , Current Protocols in Molecular Biology. 

20 Vols. 1-3 , John Wiley & Sons (1998)) , The gene or cDNA 
encoding the target macromolecule is cloned into an 
appropriate expression vector for expression in an 
organism such as bacteria, insect cells, yeast or 
mammalian cells . 



25 Appropriate expression vectors .include those 

that are replicable in eukaryotic cells and/or 
prokaryotic cells and can remain episomal or be 
integrated into the host cell genome. Suitable vectors 
for expression in prokaryotic or eukaryotic cells are 

30 well known to those skilled in the art as described, for 
example, in Ausubel et al., supra. Vectors useful for 
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regulatory elements including the SV40 early promoter, 
the cytomegalovirus (CMV) promoter, the mouse mammary 
tumor virus (MMTV) steroid-inducible promoter, Moloney 
5 murine leukemia virus (MMLV) promoter, and the like. A 
vector useful in the methods of the invention can 
include, for example, viral vectors such as a 
bacteriophage, a baculovirus or a retrovirus; cosmids or 
plasmids; and, particularly for cloning large nucleic 

10 acid molecules, bacterial artificial chromosome vectors 
(BACs) and yeast artificial chromosome vectors (YACs) . 
Such vectors are commercially available, and their uses 
are known in the art. One skilled in the art will know 
or can readily determine an appropriate promoter for 

15 expression in a particular host cell. 



If desired, a target protein can be expressed 
as a fusion with an affinity tag that facilitates 
purification of the target protein. For example, the 
target protein can be expressed as a fusion with a poly- 

20 His tag, which can be purified by metal chelate 

chromatography. Other useful affinity purification tags 
which can be expressed as fusions with the target protein 
and used to affinity purify the protein include, for 
example, a biotin, polyhistidine tag (Qiagen; Chatsworth, 

25 CA) , antibody epitope such as the flag peptide (Sigma/ St 
Louis, MO) , glutathione -S- transferase (Amersham 
Pharmacia; Piscataway, NJ) , cellulose binding domain 
(Novagen; Madison, WI) , calmodulin (Stratagene; San 
Diego, CA) , staphylococcus protein A (Pharmacia; Uppsala, 

3 0 Sweden) , maltose binding protein (New England BioLabs; 
Beverley, MA) or strep- tag (Genosys; Woodlands, TX) or 
minor modifications thereof. 
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A target macromolecule can be validated as a 
representative member of a macromolecule family. In some 
cases, the target macromolecule is well characterized 
with respect to its binding properties to a natural 
5 common ligand. However, if the target macromolecule is 
encoded by a new, relatively uncharacterized gene, the 
expressed target macromolecule can be tested to confirm 
that it binds the natural common ligand. Other common 
ligands of related macromolecule families, for example, 
10 other nucleotide binding macromolecules, or known ligand 
mimics can also be tested for binding to the target 
macromolecule . 

A target macromolecule can be further validated 
as a useful therapeutic target by determining if the 

15 selected target macromolecule is known to be required for 
normal growth, viability or infectivity of the target 
organism or cell. If it is unknown whether the target 
macromolecule is required for normal growth, viability, 
or infectivity, the target macromolecule can be 

20 specifically inactivated by gene knockout, in a model 
organism to determine if the macromolecule performs a 
critical function required for survival or infectivity of 
the organism or cell. Such a macromolecule providing a 
critical function is a good target for developing 

25 therapeutic agents. 

Methods for disrupting a gene to generate a 
knockout are well known in the art (Ausubel et al . , 
Current Protocols in Molecular Biology, Vols 1-3 , John 
Wiley & Sons (1998)). For example, transposable elements 
3 0 can be used to knockout a gene and test for the effect of 
the knockout on cell growth, viability or infectivity 
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(Benson and Goldman, J. Bacterid. 174:1673-1681 (1992); 
Hughes and Roth, Genetics 119:9-12 (1988); and Elliot and 
Roth, Mol. Gen. Genet. 213:332-338 (1988)). Methods for 
gene knockouts in protozoa have also been previously 
5 described (Wang, Parasitology 114:531-544 (1997); and Li 
et al, Mol. Biochem. Parasitol. 78:227-236 (1996)). 

Although use of the methods of the invention is 
exemplified herein with regard to proteins, it is 
understood that a method of the invention can be used for 

10 any other macromolecule that is capable of binding two or 
more ligands in proximity. Other macromolecules include, 
for example, biological polymers such as polysaccharides 
or polynucleotides or synthetic polymers such as plastics 
and mimics of biological polymers. A polynucleotide can 

15 be, for example, a ribozyme, ribosomal RNA or other RNA 
that is capable of binding a ligand such as a nucleotide. 

A method of the invention can include a step of 
identifying a common ligand. In some cases, a common 
ligand to a macromolecule family is already known. For 

20 example, NAD is a natural common ligand for 

dehydrogenases, and ATP is a natural common ligand for 
kinases. However, natural common ligands such as the 
coenzymes and cofactors often have limitations regarding 
their usefulness 1 as a starting compound. Substrates and 

25 cofactors often undergo a chemical reaction, for example, 
transfer of a group to another substrate or reduction or 
oxidation during the enzymatic reaction. However, it is 
desirable that a ligand to be used as a drug is not 
metabalizable. Therefore, a natural common ligand or a 

30 derivative thereof that is non-metabalizable is generally 
preferred as a common ligand. Examples of mimetics to 
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the common ligand NADH, for example cibacron blue, are 
described in Dye - Ligand Chromatography r Amicon Corp., 
Lexington MA (1980) . Numerous other examples of NADH- 
mimics, including useful modifications to obtain such 
5 mimics, are described in Everse et al. (eds.), The 

Pyridine Nucleotide Coenzymes , Academic Press, New York 
NY (1982) . 

A ligand that binds a macromolecule can be 
identified or characterized using a binding assay 

10 including, for example, an equilibrium binding analysis, 
competition assay, or kinetic assay as described in 
Segel, Enzyme Kinetics John Wiley and Sons, New York 
(1975), and Kyte, Mechanism in Protein Chemistry Garland 
Pub. (1995) . A common ligand can be identified by a 

15 competitive binding assay. For example, a macromolecule 
can be incubated in the presence of a known common ligand 
and a candidate common ligand, and the rate or extent to 
which the common ligand binds the macromolecule can be 
determined. Competitive binding between the common 

20 ligand and candidate ligand can be identified from a 

reduction in the rate or extent of binding of the common 
ligand to the macromolecule in the presence of the 
candidate ligand, compared to in the absence of the 
candidate ligand (see, for example, Segel, Enzyme 

25 Kinetics John Wiley and Sons, New York (1975)) . A 
candidate ligand that competes with the known common 
ligand for binding to the common ligand site on the 
macromolecule is identified as a new common ligand. 



Alternatively, absence of competitive binding 
30 of a ligand for the site on a protein to which a common 
ligand binds can be used to determine that a ligand binds 
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to a different location on the protein from the common 
ligand such as a site that is external to the common 
ligand binding site. A site that is external to the 
common ligand binding site on a protein can be, for 
5 example, a specificity ligand binding site. Binding at a 
specificity ligand binding site can be determined by 
competitive binding with a specificity ligand or mimic 
thereof. Even if the location where a ligand binds is 
not known, a determination that the ligand binds to a 

10 different location of a protein compared to a second 

ligand combined with information regarding proximity of 
the two ligands can be used to map the protein binding 
site, as demonstrated in Example III. A ligand site on a 
protein that is identified as external to a common ligand 

15 binding site provides a specificity target for which a 
binding compound can be designed. Such a binding 
compound designed to interact with the external site will 
typically show selective binding to the protein compared 
to other proteins that bind the same common ligand. 

20 In some cases, a common ligand has an intrinsic 

property that is useful for detecting whether it is 
bound. For example, the natural common ligand for 
dehydrogenases, NAD, has intrinsic fluorescence. 
Therefore, increased fluorescence in the presence of 

25 candidate common ligands due to displacement of NAD can 
be used to detect competition for binding of NAD to a 
target NAD binding macromolecule (Li and Lin, Eur . J . 
Biochem. 235:180-186 (1996); and Ambroziak and 
Pietruszko, Biochemistry 28; 5367-5373 (1989)). 



In other cases, when the common ligand does not 
have an intrinsic property useful for detecting ligand 
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binding, it can be labeled with a detectable moiety. For 
example, the natural common ligand for kinases, ATP, can 
be radiolabeled with 32 P, and the displacement of 
radioactive ATP from an ATP binding protein in the 
5 presence of a candidate ligand can be used to identify 
the candidate as a common ligand. Any detectable moiety, 
for example a radioactive or fluorescent label, can be 
added to a ligand so long as the labeled ligand can bind 
to its binding site on a macromolecule. 

1° A library of candidate ligands can be screened 

to identify a ligand that binds to a macromolecule, for 
example, at a common ligand site. Thus, a method of the 
, invention can include assaying a population of candidate 
first ligands for the ability to bind to a target 

15 macromolecule and identifying from the population of 

candidate first ligands a first ligand that binds to the 
macromolecule. A first ligand identified from such a 
screen can then be used to form a complex with the 
macromolecule and a second ligand that binds proximal to 

2 0 the first ligand can be identified. The screen can be 
performed by a competitive binding assay on a sample 
containing the macromolecule, a candidate first ligand 
and a known common ligand such that a first ligand can be 
identified as a common ligand by its ability to displace 

25 the known j common ligand. 

A library of candidate ligands can contain a 
broad range of compounds of various structures. However, 
the library of candidate ligands can also be focused on 
compounds that are more likely to bind to a particular 
30 site in a macromolecule. A focused library can be 
designed, for example, to have members that are 
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structural homologs of a natural common ligand or that 
contain moieties found in the common ligand. A library 
of candidate common ligands can also be chosen to include 
members having structural features that are commonly 
5 found in a particular class of ligands including, for 

example, a MOTIF library as described in Example II. The 
library of candidate common ligands can be a group of 
analogs or mimetics of the natural common ligand. 

One approach to identify a common ligand from a 

10 library of candidate ligands is to perform high 

throughput screening on a large library of molecules. 
The molecules can be obtained from an existing source 
such as a commercial or proprietary library or can be 
synthesized using a combinatorial synthetic method. The 

15 iterative approach to combinatorial synthesis is well- 
known in the art and is set forth, in general, in 
Houghten et al . , Nature, 354, 84-86 (1991); and Dooley et 
al., Science, 266, 2019-2022 (1994). In the iterative 
approach, for example, sublibraries of a molecule having 

20 three variable groups are made wherein the first variable 
is defined. Each of the compounds with the defined 
variable group is reacted with all of the other 
possibilities at the other two variable groups. These 
sub-libraries are each tested for binding to the target 

25 macromolecule to define the identity of the second 

variable in the sub-library having the highest affinity. 
A new sub- library with the first two variable positions 
defined is reacted again with all the other possibilities 
at the remaining undefined variable position. As before, 

30 the identity of the third variable position in the sub- 
library having the highest activity is determined with a 
binding assay using the target macromolecule. If more 



WO 03/023392 



PCT/US02/28745 



44 

variables exist, this process is repeated for all 
variables, yielding the compound with each variable 
contributing to the desired binding affinity in the 
screening process. Promising compounds from this process 
5 can then be synthesized on larger scale in traditional 
single -compound synthetic methods for further biological 
investigation. 

The -positional -scanning approach has been 
described for various organic libraries and for various 

10 peptide libraries (see, for example, R. Houghten et al. 
PCT/US91/08694 and U.S. Patent 5,556,762). In the 
positional scanning approach, sublibraries are made 
defining only one variable with each set of sublibraries 
and all possible sublibraires with each single variable 

15 defined (and all other possibilities at all of the other 
variable positions) are made and tested/ For example, 
one skilled in the art could synthesize libraries wherein 
2 fixed positions are defined at a time. From the 
testing of each single-variable defined library for 

20 binding to the target macromolecule, the optimum 

substituent at that position is determined, pointing to 
the optimum or at least a series of compounds having a 
maximum affinity. Thus, the number of sublibraries for 
compounds with a single position defined will be the 

25 number of different substituents desired at that 

position, and the number of all the compounds in each 
sublibrary will be the product of the number of 
substituents at each of the other variables. 



Once a library of candidate common ligands is 
30 selected, the library is screened, for example, by 

competition with a natural common ligand for binding to a 
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target macromolecule, to identify at least one common 
ligand in the library that binds to a conserved site in 
the target macromolecule. A common ligand identified by 
the screen is then further characterized with respect, to 
5 affinity for the target macromolecule. In some cases it 
is desirable to identify a common ligand that is not a 
high affinity ligand. Since the common ligand binds to 
multiple members of a macromolecule family, a high 
affinity common ligand would likely bind to other members 

10 of the family in addition to the target macromolecule. 

It can therefore be desirable in such cases to identify a 
common ligand having modest affinity, preferably at or 
below the affinity of the natural common ligand that 
binds to the same conserved site. Such a common ligand 

15 having modest affinity is then used as a starting 

compound for identifying a binding compound. Generally, 
a modest affinity ligand will have affinity for a 
macromolecule with an equilibrium dissociation constant 
of about 10" 2 to 10' 7 M, or about 10 -3 to 1CT 6 M. The 

20 equilibrium dissociation constant of a common ligand or 
other ligand for a target macromolecule can be greater 
than 1 x 10" 6 ML 



Another approach to identify a common ligand is 
to use the three-dimensional structure of a natural 

25 common ligand and search a database of commercially 
available molecules such as the Available Chemicals 
Directory (MDL Information Systems, Inc.; San Leandro CA) 
to identify candidate common ligands having similar shape 
or electrochemical properties of the natural common 

30 ligand. Methods for identifying similar molecules are 
well known in the art and are commercially available 
( Douce t and Weber, in Computer -Aided Molecular Design: 
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Theory and Applications , Academic Press, San Diego CA 
(1996); software is available from Molecular Simulations, 
Inc., San Diego CA) . A database can be searched, for 
example, by querying based on chemical property 
5 information or on structural information. In the latter 
approach, an algorithm based on finding a match to a 
template can be used as described, for example, in 
Martin, "Database Searching in Drug Design, " J, Med. 
Chem. 35:2145-2154 (1992) . 

10 Furthermore, if structural information is 

available for the conserved site in the macromolecule, 
particularly with a known ligand bound, compounds that 
fit the conserved site can be identified through 
computational methods (Blundell, Nature 384 Supp:23-26 

15 (1996)) . Using such an approach a common ligand can be 
identified by obtaining a structure model for the binding 
site of the macromolecule and docking a structure model 
of a candidate ligand with the structure model of the 
binding site. Algorithms available in the art for 

20 fitting a ligand structure to a protein binding site 

include, for example, DOCK (Kuntz et al . , J. Mol. Biol , 
161:269-288 (1982)) and INSIGHT 9 8 (Molecular Simulations 
Inc. , San Diego, CA) . 

A molecular structure can be conveniently 
25 stored in a computer readable medium and manipulated in a 
computer system using structural coordinates. Structural 
coordinates can occur in any format known in the art so 
long as the format can provide an accurate reproduction 
of the observed structure. For example, crystal 
30 coordinates can occur in a variety of file types such as 
.fin, .df, .phs, or .pdb as described for example in 
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McRee et al., Practical Protein Crystallography , Academic 
Press, San Diego 1993. Although the examples above 
describe structural coordinates derived from X-ray 
crystallographic analysis, one skilled in the art will 
5 recognize that structural coordinates can be in any 

format derived from or used in a method known in the art 
for determining molecular structure, 

A ligand that binds to a target protein can be 
identified using nuclear magnetic resonance methods. For 

10 example, ligand binding can be characterized 

qualitatively or quantitatively by measuring cross- 
saturation between the ligand and macromolecule when 
bound in a complex. An example of a cross -saturation 
method is WaterLOGSY in which selective water excitation 

15 is followed by NOE mixing such that magnetization is 

effectively transferred via the protein- ligand complex to 
the free ligand in a selective manner. Under these 
conditions the resonance of non-bound molecules have an 
opposite sign and tend to be weaker than the resonances 

20 for bound ligands. The macromolecule resonances can be 
suppressed with a double spin echo scheme, which also 
suppresses water, and for small and medium sized 
proteins, where double spin echo may not sufficiently 
suppress protein sequences, a T lp filter can be introduced 

25 into the pulse sequence prior to the acquisition period. 
Thus, the resonances for bound ligands can be readily 
resolved from unbound molecules and the target 
macromolecule. Accordingly, WaterLOGSY can be used to 
screen mixtures of potential ligands to identify those 

30 that bind to a target macromolecule, for example, in a 
screening format. WaterLOGSY is described in further 
detail in Dalvit et al . , J. Biomol . NMR 21:349-359 
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(2001) . Nuclear magnetic resonance can also be used to 
identify a ligand that binds a macromolecule by observing 
changes in line widths, relaxation rates or NOE values 
for a ligand upon binding to a macromolecule, as 
5 described, for example, in Ni et al . , Prog . Nucl . Maan . 
Reson. Spectrosc. 26:517-606 (1994). 

Two ligands that bind simultaneously to a 
macromolecule and in close proximity to each other can be 
identified in a method of the* invention. One of the 

10 ligands can be a common ligand, as set forth above. The 
second ligand can be any molecule that is capable of 
binding to the macromolecule in proximity with the first 
ligand at a site that is external to the common ligand 
binding site. In the case of an enzyme target, a 

15 substrate that is acted upon by a cofactor usually 

provides a reasonable candidate as a second ligand. In 
particular, the common ligand site and substrate site are 
most likely located in physical proximity to each other 
in an enzyme's three-dimensional structure to facilitate 

20 catalysis. In particular, the three-dimensional 

geometric relationship between the common ligand site and 
substrate ligand sites has been shown to be conserved in 
evolutionarily related dehydrogenases (Sem and Kasper, 
Biochemistry 31:3391-3398 (1992)). Although the 

25 relationship between the sites is conserved, the 

substrate site itself imparts molecular properties that 
distinguish the protein from other proteins in the same 
protein family. Thus, the substrate site is referred to 
as a "specificity site." The specificity site of a 

30 macromolecule provides a binding site for a ligand that 
selectively associates with the macromolecule compared to 
other macromolecules that are in the same common ligand- 
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binding family. A site that is external to a common 
ligand binding site such as a substrate specificity site 
can be exploited as a potential binding site for the 
identification of a ligand that has specificity for one 
5 macromolecule over another member of the same 

macromolecule family. A site that is external to a 
common ligand binding site such as a specificity site is 
distinct from the common ligand binding site in that the 
natural common ligand does not bind to the specificity 
10 site. 

A second ligand such as a specificity ligand 
can be identified using the above -described methods 
including, for example, a binding assay, a structural 
characterization or database search. In the case where 

15 one or both ligands are known to bind to a macromolecule, 
a method of the invention can be used to determine that 
the two ligands bind in proximity to each other. 
Furthermore, as set forth below the relative orientation 
of or distance between the two ligands can be determined 

20 and used to design a binding compound or a library of 
candidate binding compounds. 

A method of the invention can also be used in a 
screening format to identify a second ligand that is 
capable of binding to a macromolecule simultaneously with 

25 a first ligand and in proximity to the first ligand. 

Thus, a second ligand that has not been previously shown 
to bind to the macromolecule can be identified as being 
capable of binding the macromolecule based' on detection 
of an interaction with another ligand. A second ligand 

30 can be identified from a library of candidate ligands in 
a screening method. 
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Accordingly, the invention provides a method 
for obtaining a focused library of candidate binding 
compounds, wherein the members of the protein family bind 
a common ligand. The method includes the steps of (a) 
5 providing a plurality of samples containing the protein 
and a first ligand under conditions wherein the first 
, ligand and the protein form a bound complex, wherein the 
protein is a member of a family of proteins that bind a 
common ligand; (b) assaying a population of candidate 

10 second ligands for the ability to transfer magnetization 
to the first ligand in a sample from the plurality;' (c) 
identifying, from the population of candidate second 
ligands, a second ligand that transfers magnetization to 
the first ligand, thereby determining that the two 

15 ligands are proximal to each other in a ternary bound 
complex with the protein; (d) observing competitive 
binding between one of the two ligands and the common 
ligand, thereby determining that the competitive binding . 
ligand binds to the common ligand binding site of the 

20 protein; and (e) obtaining a population of candidate 
binding compounds including' the competitive binding 
ligand, or a fragment thereof, linked to one of a 
plurality of homologs of the other ligand, whereby the 
population of candidate binding compounds contains 

25 binding compounds that bind to members of the protein 
family. 

A library of second ligands can be obtained as 
set forth above. In the case of an enzyme target, the 
library can be designed based on the structure of a 
30 natural specificity ligand since a substrate that is 

acted upon by a cofactor is proximal to a common ligand, 
as set forth above. A population of second ligands can 
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also be designed to include members having structural 
features that are commonly found in a particular class of 
ligands including, for example, a MOTIF library as 
described in Example II. A population of second ligands 
5 to be used in a method of the invention can be 

synthesized using combinatorial methods similar to those 
set forth above. 

Thus, ligands that bind proximal to each other 
in a complex with a macromolecule can be identified by 

10 screening a library of candidate first ligands and a 
library of candidate second ligands. Accordingly, a 
method of the invention can include the steps of: (a) 
providing a protein that is a member of a family of 
proteins that bind a common ligand; (b) assaying a 

15 population of candidate first ligands for the ability to 
bind the protein; (c) identifying from the population of 
candidate first ligands a first ligand that binds to the 
• protein; (d) providing a plurality of samples containing 
the protein and the first ligand under conditions wherein 

20 the first ligand and the protein form a bound complex; 

(e) assaying a population of candidate second ligands for 
the ability to transfer magnetization to the first ligand 
in a sample from the plurality; (f) identifying, from the 
population of candidate second ligands, a second ligand 

25 that transfers magnetization to the first ligand, thereby 
determining that the two ligands are proximal to each 
other in a ternary bound complex with the protein; (g) 
observing competitive binding between one of the two 
ligands and the common ligand, thereby determining that 

30 the competitive binding ligand binds to the common ligand 
binding site of the protein; and (h) obtaining a 
population of candidate binding compounds including the 
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competitive binding ligand, or a fragment thereof, linked 
to one of a plurality of homologs of the other ligand, 
whereby the population of candidate binding compounds 
contains binding compounds that bind to members of the 
5 protein family. 

An advantage of designing a bi -ligand binding 
compound based on screening candidate ligand libraries is 
that the number of bi-ligand compounds that need to be 
synthesized and tested compared with a classic structure 

10 activity relationship (SAR) approach is reduced. For 
example, two libraries of 1000 ligands can be rapidly 
screened to identify a small number of ligands that bind 
a macromolecule. Pairs of identified ligands can then be 
combinatorially assayed for the ability to simultaneously 

15 bind the macromolecule . If two ligands are found to 

simultaneously bind the macromolecule, proximal atoms on 
each ligand can be determined to guide chemistry to link 
the two ligands, with a small number of different 
linkers, for example, 5. Thus, only 5 compounds need to 

20 be synthesized. In contrast, a more traditional SAR 
approach would be to synthesize all possible pairs, 
resulting in a library of about 1000 x 1000 x 5 = 5 
million compounds. 

Two ligands that bind simultaneously to a 
25 macromolecule and in close proximity to each other can be 
identified in a method of the invention by detecting 
magnetization transfer between the two ligands when bound 
in a ternary complex with the macromolecule. For 
example, as demonstrated in Example I, interactions 
30 between proximal ligands can be identified based on NOE 
crosspeaks observed in a 2D ^H/H) NOESY spectrum. In 
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the case of a (^^H) NOESY spectrum obtained for ligands 
in a ternary complex with a macromolecule, observation of 
cross peaks occurring at the chemical shift positions of 
the atoms from separate ligands indicate that the atoms 
5 are proximal. Such inter-ligand NOE peaks can be 

resolved from intra-ligand NOE signals by adjusting the 
mixing time in the NOESY pulse sequence. Because the 
strength of an NOE interaction between two protons is 
dependent on l/r 6 , where r is the distance between the two 
10 protons, and because most inter-ligand proximal protons 
will be further apart than intra-ligand proximal protons, 
the mixing, time can be increased to allow selective 
detection of inter-ligand NOE peaks compared to most 
intra-ligand and intra-protein NOE peaks. 

15 Typically magnetization transfer between 

proximal ligands is observed in a sample having a molar 
excess of ligands compared to the protein to which they 
bind. Because the lifetime of an alteration to a nucleus 
due to a magnetization transfer, such as an NOE 

20 interaction, is usually longer than the residence time 
for ligands in a complex with a protein, the number of 
ligand nuclei for which magnetization transfer is 
observed exceeds the number of protein molecules in the 
sample. Thus, the protein acts to turn over altered 

25 ligands to amplify the observed signal in a condition 

where the ligand is in excess. The amplified signals can 
be readily distinguished from signals arising from the 
protein. 



In cases where ligand is not in excess over the 
30 protein, a signal that arises from a ligand atom can be 
identified using known methods for assigning resonances. 
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Such signals can be differentiated from signals arising 
from other atoms in a sample using isotope enrichment. 
For example, where protons of a ligand are to be observed 
the protein to which the ligand binds can be labeled with 
5 deuterium ( 2 H) to remove signals arising from the protein. 
A ligand enriched with an NMR detectable isotope at an 
observed atom position can be used to enhance detection 
of a signal arising from the ligand atom. In addition, a 
signal can be selectively detected when an isotope filter 
10 or relaxation filter is used such as any of those 

described in Cavanaugh et al . , Protein N MR Spectroscopy: 
Principles and Practice , ch. 7, Academic Press, San Diego- 
CA (1996) . 

A protein can be isotopically labeled with 2 H 
15 atoms to simplify spectra by replacing NMR-visible X H 

atoms. For example, 2 H atoms can be incorporated at both 
exchangeable and non- exchangeable positions in a 
macromolecule by growing an organism expressing the 
macromolecule in the presence of D 2 0 ( 2 H 2 0) . 2 H atoms can 
20 be incorporated or maintained at exchangeable positions, 
such as at amides or hydroxyls of a protein, by carrying 
out steps in the isolation of the macromolecule in 
deuterated solvent. For protein labeling, acetate or 
glucose can be provided as the sole carbon source in the 
25 presence of D 2 0 if complete deuteration on carbon is 

desired. If pyruvate is used as the sole carbon source, 
there will be protons only on the methyl groups of Ala, 
Val, Leu and He (Kay, Biochem. Cell Biol. 75:1-15 
(1997) . 

30 When NOE methods are used to identify proximal 

ligands, the measurements can be performed at low 
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enable the observation of inter- ligand NOES at shorter 
mixing time. As temperature decreases, mixing time can 
be decreased resulting in ( x H r l E) NOESY spectra with 
5 increased sensitivity, thereby allowing observation of 
peaks that are not visible or that are difficult to 

distinguish at higher temperatures. Furthermore, N 
measurement of inter- ligand NOES at shorter mixing times 
and lower temperature also decreases spin-diffusion and 

10 protein mediated magnetization transfer which often have 
deleterious effects on the intensity of NOE signals. In 
general, higher sensitivity NOE measurements can be 
obtained at temperatures below 10° C. Thus, a method of 
the invention can include detecting magnetization 

15 transfer at temperatures below 8° C, below 5° C or below 
2° C, so long as the sample is in a liquid state. 

Further, two ligands that bind simultaneously 
to a macromolecule and in close proximity to each other 
can be identified in a method of the invention by 

20 detecting magnetization transfer between the two ligands 
when bound simultaneously with the macromolecule. For 
example, interactions between proximal ligands can be 
identified based on measuring cross-saturation between 
the ligands when bound in a complex with the 

25 macromolecule. A saturation transfer difference (STD) 
method can be applied in which selective excitation of a 
particular resonance of one ligand is followed by 
polarization transfer such that magnetization is 
effectively transferred, in a selective manner to a 

30 proximal ligand when bound in a complex. Under these 

conditions the resonances of non-bound molecules have an 
opposite sign and tend to be weaker than the resonances 
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be used to screen individual compounds or mixtures of 
potential ligand to identify those that bind to a target 
macromolecule . The cross saturation achieved via 
5 inter-ligand magnetization transfer can be achieved with 
natural abundance of NMR-visible isotopes. However, it 
can be advantageous to use deuterium labeled protein to 
effectively remove the effects of the proton mediated 
magnetization transfer from the protein. The STD method 

10 is particularly useful for use in a screening format 
because the NMR signals that are used for identifying 
proximal ligands can be collected on a relatively short 
time frame compared to other methods of determining 
ligand binding. In addition to providing a functional 

15 identification that the ligand binds to the protein, the 
STD method when used to identify proximal ligands 
provides structural information regarding the relative 
location of ligands when bound to the protein. 

Because the proximity of ligands is determined 
20 based on detection of interactions between ligands and 
does not require detection of interactions with the 
macromolecule to which they are bound, isotopically 
labeled macromolecules are not necessary. Thus, a 
macromolecule used in a method of the invention can 
25 contain a natural abundance of NMR-visible isotopes for 
the atoms it contains. Examples of NMR-visible isotopes 
are X H which is present in a natural abundance of 99.98%, 
13 C which is present in a natural abundance of 1.11% and 
15 N which is present in a natural abundance of 0.3 7%. A 
30 macromolecule can contain at most about 1% of the non- 
NMR-visible hydrogen isotope 2 H, at most 1.5% of 13 C or at 
most about 0.5% of 15 N. 
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Although labeled macromolecules are not 
required, a labeled macromolecule can be used in a method 
of the invention. For example, once proximal ligands are 
identified, the orientation of one or more ligands can be 
5 confirmed or further investigated by identifying NMR 
interactions with a labeled macromolecule. In 
applications where labeling of a macromolecule is desired 
in order to further investigate the orientation of one or 
more ligand when bound to the macromolecule or to 

10 investigate structural properties of the macromolecule 

binding site, strategies and methods known in the art for 
introducing one or more isotopic label can be used (see, 
for example, Laroche, et al . , Biotechnology 12:1119-1124 
(1994); LeMaster Methods Enzvmol , 177:23-43 (1989); 

15 Muchmore et al . , Methods Enzvmol. 177:44-73 (1989); 
Reilly and Fairbrother, J. Biomolecular NMR 4 : 459-462 
(1994); Ventors et al . , J. Biomol . NMR 5:339-344 (1995); 
and Yamazaki et al . , J. .Am. Chem. Soc. 116:11655-11666 
(1994) ) . 

20 A method of the invention is well suited for 

use with large macromolecules because proximal ligands in 
a complex with a macromolecule can be identified absent 
knowledge of the structure of the macromolecule or 
assignment of resonances for atoms of the macromolecule . 

25 In particular, large macromolecules having a monomeric 

molecular weight greater than 20 kDa, which often are not 
completely NMR assigned, or for which complete structure 
models are not available, can be characterized with 
respect to pairs of ligands that bind thereto. Because 

3 0 observation of magnetization transfer between ligands can 
be enhanced when the ligands experience low rotational 
mobility, macromolecules having monomeric molecular 
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weights greater than 25 kDa, 30 kDA, 40 kDa, 50 kDa, 75 
kDa, 100 kDa or 150 kDa can be used. Furthermore, a 
method of the invention can be used to identify proximal 
ligands for other macromolecules with low rotational 
5 mobility such as membrane bound proteins or multimeric 
proteins having at least 2, at least 3, or at least 4 
monomers, wherein the monomers can have monomeric 
molecular weights in the range described above. 

Because structural analysis of the 
10 macromolecule itself is not required to identify or 
characterize proximal ligands in a method of the 
invention, a macromolecule can be used for which 
resonance assignments have not been made for a majority 
of the atoms in the macromolecule. Thus, a method of the 
15 invention can use a macromolecule for which less than 
90%, 80%, 70%, 60%, 50%, 40%, 30%, 20% or 10% of the 
atoms have been assigned a resonance. 

Proximal ligands are identified in the methods 
under conditions where a majority of the macromolecule is- 

20 bound in a complex with two ligands. A condition in 
which a majority of the macromolecule is bound in a 
complex with two ligands can be achieved when the 
macromolecule is present at relatively low concentrations 
and excess ligand is present. Thus, although a method of 

25 the invention can be performed with millimolar 

concentrations of a macromolecule, as is often required 
for structure determination by NMR, lower concentrations 
such as concentrations below 200 micromolar can be used. 
The use of low concentrations of a macromolecule is 

30 advantageous when the target macromolecule is available 
in limited supplies or where screening procedures require 
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a large number of samples containing the macromolecule. 
In such cases, concentrations of the macromolecule below 
100 micromolar, 50 micromolar, 40 micromolar, 25 
micromolar, or 10 micromolar can be used. 

5 Ligands can be added to a macromolecule - 

containing sample in molar excess such that a majority of 
the macromolecule in the sample will be bound in a 
complex with the ligands. The extent of the molar excess 
can be determined using known methods for determining 

10 percent occupancy based on equilibrium binding equations, 
a known or predicted affinity constant of a ligand for a 
macromolecule and the concentration of the macromolecule 
in a sample (see, for example, Segel, supra) . 
Alternatively, excess ligands can be added and the amount 

15 sufficient to result in a majority of the macromolecule 
being bound to the ligands, can be determined 
empirically, for example, by titration. 

Proximal ligands are identified in the methods 
under conditions where the ligands bound in a complex 

20 with the macromolecule are inert to catalysis by the 
macromolecule. In cases where the macromolecule is a 
catalyst, a ligand mimic can be chosen that does not 
undergo catalysis or that undergoes catalysis at a rate 
that is slow compared to the timeframe in which ligand 

25 interactions are measured. In cases where a reactive 
ligand is used with an enzyme, conversion of the ligand 
to a product can be prevented by altering conditions such 
that catalytic activity of the enzyme is inhibited. For 
example, anaerobic conditions can be employed to inhibit . 

3 0 reactions requiring oxygen, pH can be adjusted to inhibit 
reactions requiring a particular protonation state of a 
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catalytic residue, or a noncompetitive inhibitor can be 
added. 



Once a pair of proximal protons from separate 
ligands is identified for a particular ternary complex, 
5 the distance between the ligands can be estimated. In 
particular, an atom of a first ligand that is proximal to 
an atom of a second ligand in a ternary complex can be 
identified. For example, the distance between the 
ligands can be estimated based on the distance separating 
10 the proximal protons as determined by measurements of NOE 
build-up rates using methods described in Cavanaugh et 
al . , supra (1996) . 



The distance determined to separate the 
proximal protons can then be used in combination with the 

15 average bond lengths separating other ligand atoms from 
the protons to estimate inter-ligand atomic distance in 
the ternary complex. For example, the distance between 
the atoms from each ligand that are directly bonded to 
the proximal protons can be estimated from the sum of the 

2 0 NOE measured distance and the theoretical lengths of both 
atom-proton bonds. Similarly, by summing the bond 
lengths separating other atoms from the proximal protons 
and considering bond angles, the distance separating 
these other atoms can be estimated. Even when distances 

25 are not measured, two ligands can be identified as 

proximal based on observation of magnetic interactions, 
when spin diffusion is absent or otherwise accounted for. 



Spin diffusion can be eliminated using QUIET 
NOESY (Quenching Undesirable Indirect External Trouble in 
30 NOESY, Neuhaus et al . "The Nuclear Overhauser Effect in 
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York, 2000) or NOE build-up curves. QUIET NOESY 
measurements can be performed to avoid artificial NOE 
cross-peaks arising from spin diffusion. These 
5 measurements differ from a conventional NOESY 

measurements by the presence in the middle of the mixing 
time of a selective (or a combination of selective) 180 
degree pulse (s) to invert only the signals of the two 
protons for which the length of separation is to be 

10 determined. NOE build-up curves can be used to plot NOE 
vs. mixing time such that signals due to direct NOE 
transfer can be differentiated from those that are 
indirect or due to spin diffusion based on the shapes of 
the curves as described, for example, in Cavanaugh et 

15 al. , supra (1996) . 

Those skilled in the art will understand that 
depending upon the degree of conformational freedom for 
the ligands in an observed ternary complex and the number 
of observed inter-ligand interactions, the estimation of 

20 distances between atoms that are increasingly removed 
from the proximal ligands can have different levels of 
precision. For example, in the case of two proximal 
aromatic ring structures for which two pairs of 
interactions are observed, the relative orientations of 

25 the rings can be estimated with relatively high precision 
and the distance separating any of the atoms in the two 
ring system can be determined with a relatively high 
level of confidence due to two-point anchoring between 
the planar rings. As set forth below, the estimated 

3 0 distance separating two ligands can be used to guide the 
selection of a linker to attach the proximal ligands in 
designing a bi-ligand binding compound. Depending upon 
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the level of confidence with which the distance is 
determined, the variety of linker types and ligand 
attachment points represented in a library of potential 
binding compounds can be adjusted, 

5 A ligand-probe having an attached antenna 

moiety can be used in a method of the invention. An 
antenna moiety provides an NMR-detectable nucleus that 
can occupy a position away from the ligand moiety of the 
ligand-probe such that a magnetic interaction between the 

10 nucleus and a proximal ligand can be used to identify the 
relative location of the proximal ligand even if it is 
too distant to magnetically interact with the ligand 
moiety of the ligand-probe. Thus, an antenna moiety can 
extend the range within which proximal ligands can be 

15 identified. 

Accordingly, the invention provides a method 
for obtaining a focused library of candidate binding 
compounds for a protein family, wherein the members of 
the protein family bind a common ligand. The method 

20 includes the steps of: (a) providing a ligand-probe 

having an antenna moiety, wherein the ligand-probe binds 
to the common ligand binding site of a protein, wherein 
the protein is a member of the protein family; (b) 
providing a sample containing the protein, the ligand- 

25 probe and a second ligand under conditions wherein the 
ligand-probe, the second ligand and the protein form a 
bound complex; (c) detecting magnetization transfer 
between the antenna moiety of the ligand-probe and the 
second ligand in the bound complex, thereby determining 

30 that the antenna moiety and second ligand are proximal in 
the bound complex; and (d) obtaining a population of 
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candidate binding compounds comprising the ligand-probe, 
or a fragment thereof, linked to one of a plurality of 
second ligand homologs, whereby the population contains 
binding compounds that bind to members of the protein 
5 family. 



Also provided is a method for obtaining a 
focused library of candidate binding compounds, wherein 
the members of the protein family bind a common ligand. 
The method includes the steps of: (a) providing a 

10 ligand-probe having an antenna moiety, wherein the 

ligand-probe binds to the common ligand binding site of a 
protein, wherein the protein is a member of the protein 
family; (b) providing a plurality of samples containing 
the protein and the ligand-probe under conditions wherein 

15 the ligand-probe and the protein form a bound complex, 
wherein the protein is a member of a family of proteins 
that bind a common ligand; (c) assaying a population of 
candidate second ligands for the ability to transfer 
magnetization to the antenna moiety of the ligand-probe 

20 in a sample from the plurality; (d) identifying, from the 
population of candidate second ligands, a second ligand - 
that transfers magnetization to the antenna moiety of the 
ligand-probe, thereby determining that the two ligands 
are proximal to each other in a ternary bound complex 

25 with the protein; and (e) obtaining a population of 

candidate binding compounds comprising the ligand-probe, 
or a fragment thereof, linked to one of a plurality of 
homologs of the other ligand, whereby the population of 
candidate binding compounds contains binding compounds 

3 0 that bind to members of the protein family. 
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Based on the length of an antenna moiety and 
its point of attachment to a ligand moiety, the relative 
location of a proximal ligand can be determined. 
Depending on where the antenna moiety is attached in the 
5 ligand-probe, the direction and approximate location of 
the other proximal ligand relative to the ligand moiety 
can be determined. Because the ligands are bound at a 
particular orientation in their respective binding sites, 
an antenna moiety can provide information regarding the 
10 relative structural relationships of proximal binding 
sites . 

An antenna moiety can have any of a variety of 
structures that extend from a ligand moiety including, 
for example, those described below with respect to 

15 linkers, so long as an NMR- observable nucleus is 

included. An antenna moiety can have a structure that is 
selected based on a particular distance desired for 
separating an NMR-observable nucleus and the ligand 
moiety to which it is attached or based on a particular 

20 orientation for the MVTR-observable nucleus relative to 
the ligand moiety. The relative distance and 
orientation can be determined based on visual inspection 
of a structure model for the protein to be tested or of a 
homolog of the protein. The distance and orientation can 

25 also be empirically determined, for example, by iteration 
of a method of the invention where the composition of the 
antenna moiety is altered until a, desired or diagnostic 
interaction is observed. . 

An antenna moiety can have a structure that is 
30 selected based on magnetic properties to be observed. 
For example, an antenna moiety can contain an NMR- 
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observable nucleus that is magnetically isolated from 
other atoms in the ligand probe to facilitate or improve 
a particular NMR measurement. As demonstrated in Example 
IV use of an ether linkage favored observation of direct 
5 NOE interactions between a terminal methyl and a proximal 
ligand at short mixing times compared to indirect NOE 
interactions due to magnetization transfer in the ligand 
probe. The ether linkage further provided an environment 
for the terminal methyl protons where relaxation effects 

10 due to vicinal proton coupling did not occur, thereby 
providing a stronger signal for the methyl protons. 
Isolation of an NMR-observable nucleus can also be 
achieved by providing an adjacent ether oxygen; carbonyl 
carbon; thioether, sulfone or sulfoxide sulfur; 

15 deuterated carbon; selenium; or nitrogen. An NMR- 
observable nucleus used in an antenna mioety can be at an 
internal position or at a terminal position. An example 
of an internal position that is useful is a proton in a 
phenyl or other aromatic ring structure that is 

20 deuterated at the other positions. 

An antenna moiety can also have an NMR- 
observable nucleus that is in an environment that differs 
from that of other nuclei in the ligand-probe such that 

25 the nucleus for which observation is desired can be 
selectively excited. As demonstrated in Example V, 
methyl protons at a position terminal to an aliphatic, 
ether-containing antenna moiety were selectively 
saturated compared to the aromatic protons of the 

30 attached ligand moiety. Those skilled in the art will 
understand that antenna moieties of different lengths, 
composition or point of attachment can be routinely 
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tested using a binding assay with the target 
macromolecule . 



A ligand-probe can contain a plurality of 
antenna moieties, such as 2, 3, 4, 5, or more antenna 
5 moieties attached to a ligand moiety, thereby forming a 
ligand multi-probe. The composition, length and point of 
attachment for each antenna moiety of a ligand multi- 
probe can be determined as described above. The antenna 
moieties included in a ligand multi-probe can be selected 

10 such that the nuclei of each antenna that is to be 

observed will resonate at a frequency that is readily 
distinguished from the other antenna nuclei that are to 
be observed. Thus, the nuclei of the antenna moieties 
can be separated in a single spectrum to facilitate 

15 identification of a proximal ligand and determination of 
its orientation relative to the ligand multi -probe. 

Comparison of signals arising from antennas 
attached at different points in a ligand multi -probe 
provide information regarding the direction and distance 

20 that separates it from one or more proximal ligands. 

Thus, the ligand multi-probe can be used to determine the 
direction and approximate location of proximal ligands 
bound to different sites on the protein. A ligand multi- 
probe can also be used to determine the orientation of 

25 the ligand moiety of the ligand multi-probe relative to 
the proximal ligand based on the points on the ligand 
moiety at which each antenna moiety is attached and the 
atoms of the proximal ligand that the antenna moiety 
interacts with. Thus, using the methods described below, 

30 a linker can be designed to connect the two ligands, or 
fragments thereof or homologs thereof, such that their 
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relative positions are in accordance with the observed 
orientations. It is understood that an antenna can be 
attached to any of a variety of ligands including, for 
example, a common ligand or specificity ligand. 

5 Once proximal ligands of a macromolecule are 

identified they can be used to design a binding compound 
or a library of candidate binding compounds for the 
macromolecule, A binding compound can contain a moiety 
formed by a first ligand, fragment of the first ligand or 

10 homolog of the first ligand, attached by a linker to any 
of a second ligand, fragment of the second ligand or 
homolog of the second ligand. A fragment of a ligand 
included in a linked binding compound can be any portion 
of the ligand that interacts with the target protein in 

15 such a way as to participate in specific binding. For 
example, a fragment of a ligand-probe that can be linked 
in a binding compound can be a ligand moiety, or fragment 
thereof . 



A portion of a ligand that interacts with a 
20 protein can be identified according to magnetic 

interactions of atoms of the particular portion of the 
ligand with atoms of the protein. Such interactions can 
be observed using methods known in the art such as those 
described in WO 99/60404 and U.S. Pat. No. 5,698,401. A 
25 portion of a ligand that interacts with a protein can 
also be identified by visual inspection of a structure 
model of a complex of the ligand and protein, such as an 
X-ray crystallographic or NMR structure; docking of a 
structure model of the ligand to a structure model of the 
30 protein; or comparison to other ligands that bind to the 
protein. 
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A library of candidate binding compounds can be 
obtained in which a moiety formed by a first ligand, 
fragment of the first ligand or homolog of the first 
ligand is linked to a variety of homologs of the other 
5 ligand. Where the library is directed to one or more 
protein in a common ligand binding family, diversity of 
the library occurs at the portion of the binding compound 
that will interact with the specificity ligand binding 
site (specificity portion) , thereby providing specificity 
10 for particular members of the family. The common ligand 
portion of the bi-ligand provides favorable interactions, 
thereby improving affinity of the binding compound for 
its target compared to the affinity that would be 
provided by the specificity portion alone. 



15 Diversity of the library can be further 

increased by using a variety of linkers or diverse 
combinations of homologs of both ligands. A homolog or 
population of homologs can be selected based on 
structural similarity to a particular ligand. A 

20 population of homologs can also be produced by a 

combinatorial approach in which a core structure of a 
ligand is modified or in which moieties found in a 
particular ligand are combined. 

A linker is selected based on the ability to 
25 provide sufficient length and conformational freedom for 
the ligands, or homologs thereof, to associate with their 
respective sites on the macromolecule . A linker can 
include any number of atoms that can attain a 
conformation resulting in the desired length between 
30 linked moieties including, for example, 1, 2, 3, 4, 5, 6, 
7, 8, 9, 10 or more atoms that are linearly connected. 
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Linear connection is used to describe the positions of 
the atoms relative to each other in a linker and is not 
intended to limit the linker to a linear structure. 
Accordingly, a linker can have atoms that form branch 
5 structures off of linearly connected atoms or a linker 
can be formed by one or more cyclic structure. 

A linker can be directly attached to a ligand 
at one of the atoms in the proximal pair. The linker can 
also be attached to a ligand, or homologs thereof, at the 
10 position of one of the atoms in the proximal pair whether 
or not the same atom occupies the position in the 
original ligand and in the linked compound. The linker 
is designed to have at least two positions for attaching 
at least two ligands, or homologs thereof. 

15 The point of attachment for a linker on each 

ligand, or homologs thereof, can be chosen to result in a 
binding compound having the ligands, or homologs thereof, 
separated by a distance similar to that observed for the 
ligands in the macromolecule complex. The distance 

20 between the two linked portions of a binding compound can 
be determined based on the positions of the proximal 
atoms in each portion. The position of one or both atoms 
can be occupied by another atom that is, for example, 
present due to the chemistry selected for attachment. 

25 The distance between the portions can also be determined 
based on the positions of other atoms where the relative 
positions in the bound complex are known. 



Those skilled in the art will understand that 
linkers of different lengths, composition or points of 
30 attachment can be routinely tested using a binding assay 
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with the target macromolecule. The number of variations 
to be tested can be determined, for example, based on the 
degree of confidence in the distance estimate for the two 
ligands to be joined. Variations can be individually 
5 tested in a binding assay with the target macromolecule 
or a library of variants can be screened for the ability 
to bind to the target macromolecule. Thus, a method of 
the invention can be carried out in an iterative fashion 
wherein the steps of the method are repeated with linkers 
10 of different lengths or compositions until a binding 

compound having a desired linkage is obtained. Similar 
iterations can be performed with different linked 
moieties until a binding compound having a desired 
affinity or specificity or both is identified. 

15 In another embodiment a common ligand is linked 

to each of a plurality of homologs or a proximal ligand 
to create a focused library of candidate binding 
compounds. The use of a natural common ligand, or mimic 
thereof, as a partner in a linked bi -ligand can be 

20 advantageous because natural common ligands can be more 
effective in crossing biological membranes such as 
bacterial or eukaryotic cell membranes. For example, a 
transport system actively transports the nicotinamide 
mononucleotide half of the NAD molecule (Zhu et al., J. 

25 Bacteriol. 173 ; 1311-1320 (1991)). Therefore, it is 

possible that a bi-ligand comprising a common ligand, or 
derivative thereof, that is actively transported into a 
cell will facilitate the transport of the bi-ligand 
across the membrane. 

30 Linkers that are useful for generating a 

binding compound include, for example, substituted 
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phosgene, urea, furane and salicylic acid. However, any 
chemical group with two reactive sites that can be used 
to position a first ligand and a second ligand in an 
optimized position for binding to their respective sites 
5 can be used as a 1 inker . 

Another group of linkers includes molecules 
containing phosphorous. These phosphorus -containing 
molecules include, for example, substituted phosphate 
esters, phosphonates, phosphorami dates and 
10 phosphorothioates . The chemistry of substitution of 
phosphates is well known to those skilled in the art 
(Emsley and Hall, The Chemistry of Phosphorous: 
Environmen tal, Organic, Inorganic and Spectroscopic 
Aspects, Harper & Row, New York (1976); Buchwald et al . , 
!5 Methods Enzvmol . 87:279-301 (1982); Frey et al . , Methods 
Enzymol . 87:213-235 (1982); Khan and Kirby, J. Chem. Soc. 
B: 1172-1182 (1970)) . A related category of linkers 
includes phosphinic acids, phosphonami dates and 
phosphonates, which can function as transition state 
20 analogs for cleavage of peptide bonds and esters as 

described previously (Alexander et al., J. Am. Chem. Soc, 
112:933-937 (1990)). The phosphorous -containing 
molecules useful as linkers can have various oxidation 
states, both higher and lower, which have been well 
25 characterized by NMR spectroscopy (Mark et al . , Progress 
in NMR Spectroscopy 16:227-489 (1983)) . 

The reactive groups on a linker and the 
ligands, or homologs thereof, to be attached should be 
reactive with each other to generate a covalent 
30 attachment of the ligands, or homologs thereof, to the 
linker at a sufficient distance for binding to their 
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respective binding sites on the macromolecule . A 
preferred reaction is that of a nucleophile reacting with 
an electrophile. Many of the above described linkers 
have electrophilic groups available for attaching 
5 ligands. Electrophilic groups useful for attaching 

ligands include electrophiles such as carbonyls, alkenes, 
activated esters, acids and alkyl and aryl halides. 

The linkers having electrophilic groups are 
preferably attached to ligands, or homologs thereof, 

10 having nucleophilic groups including, for example, 

alcohols, amines, or mercaptans. However, if a ligand, 
or homolog thereof, is identified that does not have 
appropriate reactive groups for attaching a linker, it 
can be modified to incorporate a reactive group at or 

15 near the position of an atom that was identified as one 
of the proximal atoms. If the ligand, or homolog 
thereof, cannot be modified to generate an appropriate 
reactive group in a desired position, an additional 
screen can be performed, as described above, to identify 

20 a homolog having desired binding characteristics as well 
as a chemical group in the proper position for attachment 
of a linker. 

A compound that binds a protein can be obtained 
/by screening a library of binding compounds for the 
25 ability to bind to a target macromolecule and identifying 
a member of the library that binds to the protein. The 
screen can be performed using the methods described above 
for determining binding of a ligand to a macromolecule. 
The compound can have specificity for a first protein 
30 over a second protein. For example, a compound can have 
specificity for a first protein that binds a common 
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ligand compared to a second ligand that binds the same 
common ligand. A binding compound obtained by a method 
of the invention can have specificity for one or more 
protein of a common-ligand binding family compared to a 
5 non- family protein. Such specificity can be due to more 
favorable interactions of the specificity portion of a 
compound with the first protein compared to its 
interactions with the second protein. Specificity can be 
characterized as at least about 2 fold higher affinity, 

10 at least about 3 fold higher affinity, at least about 4 
fold higher affinity, at least about 5 fold higher 
affinity, at least about 10 fold higher affinity, at 
least about 25 fold higher affinity, at least about 50 
fold higher affinity, at least about 100 fold higher 

15 affinity or at least about 1000 fold higher affinity. 

A binding compound obtained by a method of the 
invention, by combining moieties from two ligands that 
bind proximal to each other in a complex with a protein, 
will have higher affinity or specificity for the protein 
20 than the affinity or specificity of either ligand alone. 
The affinity of a compound, obtained by a methods of the 
invention, for a protein can have an equilibrium 
dissociation constant of at most about 10" 6 M, 10" B M, 10' 9 
M, 10' 10 M, lO" 11 M or 10' 12 M. 

25 Although the methods of the invention have been 

described above with respect to a complex in which two 
ligands bind a macromolecule and in which a bivalent 
binding compound is designed, a method of the invention 
can also be used to identify 3 or more ligands that are 

30 proximal when bound to a macromolecule and to link the 
ligands using 2 or more linkers in order to form a multi- 
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ligand binding compound. A method of the invention can 
also be used to design a mult i -ligand binding compound by 
sequentially adding ligands to a binding compound. Thus, 
a method of the invention can include the steps of (a) 
5 obtaining a sample containing a macromolecule, a first 
ligand and a second ligand under conditions wherein a 
bound complex is formed containing the first ligand, the 
second ligand and the macromolecule; (b) detecting 
magnetization transfer between the first ligand and the 

10 second ligand in the bound complex; (c) determining from 
the magnetization transfer the distance between an atom 
of the first ligand and an atom. of the second ligand in 
the bound complex; (d) obtaining a binding compound 
including the first ligand, or a fragment thereof, the 

15 second ligand, or a fragment thereof, and a linker, . 
whereby the binding compound is capable of binding the 
macromolecule; and (e) repeating steps (b) through (d) , 
wherein the first ligand is replaced by the binding 
compound obtained in step (d) and the second ligand is 

20 replaced by another ligand. 

Once a binding compound has been obtained its 
specificity for a particular member of a macromolecule 
family can be determined by determining the affinity of 
the compound for the macromolecule compared to other 

25 macromolecules in the family. If the compound binds to a 
first macromolecule with higher affinity or at a faster 
rate than a second macromolecule it will be identified as 
being specific for the first macromolecule. Although 
absence of binding to the second molecule in such an 

30 assay is preferable in many situations, any increase in 
specificity for the first macromolecule over the second 
can be exploited in applications were specificity is 
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desired. Furthermore, determination that a compound has 
specificity for one macromolecule over another, even if 
moderate, can identify the compound as a candidate for 
iterative improvement in a method of the invention. In 
5 particular, the compound so identified can be bound in a 
complex with the target macromolecule and used to 
identify a ligand that binds in a proximal location. 
Covalent linkage of the compound and proximal ligand can 
yield a subsequent binding compound with higher affinity 
10 and improved specificity for the macromolecule compared 
to other macromolecules in the same family. 

The invention further provides a method for 
identifying a compound having specificity for a 
particular member of a protein family, compared to other 

15 members of the protein family, prior to synthesizing the 
compound. The method includes the steps of: (a) 
observing competitive binding of the common ligand and a 
first ligand to a first protein, (b) observing 
competitive binding of the common ligand and a first 

20 ligand to a second protein, wherein the first and second 
proteins are members of the protein family, thereby 
determining that the first ligand binds to the common 
ligand binding site of the first and second proteins; (c) 
providing a sample containing the first protein, the 

25 first ligand and a second ligand; (d) providing a sample 
containing the second protein, the first ligand and the 
second ligand; (e) comparing the degree of magnetization 
transfer between the first ligand and the second ligand 
for the samples of parts (b) and (c) ; and (f ) obtaining a 

30 binding compound including the first ligand, or a 
fragment thereof, linked to the second ligand, or a 
fragment thereof, whereby the binding compound 
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selectively binds the first protein compared to the 
second protein. 

Two or more ligands to be linked, or for which 
homologs can be identified, in order to produce a binding 
5 compound with specificity for a first macromolecule over 
a second macromolecule can be identified by comparison of 
magnetization transfer between the ligands when bound to 
the different macromolecules . Absence of magnetization 
transfer between ligands in the presence of the second 

10 macromolecule will indicate that at least one of the 
ligands does not bind the second macromolecule or that, 
if the ligands both bind they are relatively distal from 
each other. Thus, a binding compound in which the 
ligands, or homologs thereof, are linked according to 

15 distances observed in the first macromolecule will have 
reduced affinity for the second macromolecule either 
because one of the ligands, or homologs thereof, does not 
contribute to a favorable binding interaction or because 
the ligands, or homologs thereof, are sterically 

20 constrained from binding to both sites on the second 

macromolecule. Similarly, a library of candidate binding 
compounds will have a higher probability of containing a 
compound that is specific for the first macromolecule. 

A greater degree of magnetization transfer 
25 between two ligands when bound to a first macromolecule 
compared to when bound to a second macromolecule can 
indicate a shorter distance between the ligands in the 
first macrolecule. Based on the distance measured 
between the ligands in both macromolecule complexes the 
30 length of a linker can be chosen to favor binding to the 
first macromolecule by being long enough to allow the two 
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ligands, or homologs thereof, to bind to both sites on 
the first macromolecule but too short to allow both 
ligands, or homologs thereof, to bind their respective 
sites on the second macromolecule. 

5 The following examples are intended to 

illustrate but not limit the present invention. 

EXAMPLE I 

Design of a Potent and Specific Bi-Ligand for p38Cf MAP 

Kinase 

10 This Example demonstrates use of the methods of 

the invention to design a potent and selective inhibitor 
for activated p38a MAP kinase, starting from relatively 
weak binding fragments. 

A library of 29 PBBA structure analogs was 
15 screened against unlabeled p38a MAP kinase (p38a)as 
follows. Samples containing 10 to 30 micromolar 
concentration of unactivated p38a MAP kinase and 0.1 to 
1.0 millimolar concentration of one of the PBBA structure 
analogs were obtained. The samples were screened using 
20 WaterLOGSY with saturations times of 2s (frequency 

selective excitation via a train of 232 6ms n pulses with 
a GAUSS profile at an 80Hz RF field strength) with 
solvent suspension using WATERGATE at 4°C. WaterLOGSY is 
further described in Dalvit et al . J. Biomol. NMR 
25 21:349-59 (2001) Dalvit et al. J. Magn. Res. B112:282-288 
(1996) and Dalvit et al . , J. Biomol. NMR 11:437-444 
(1998) . Among the compounds screened, p-butyl benzoic 
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acid (PBBA) showed cross -saturation effects with p3 8a 
indicating binding to the protein. 



Potential inhibitors designed to mimic the 
natural cof actor, as determined by visual inspection of 
5 commercially available compounds and assessment of the 
scientific literature pertaining to kinase medicinal 
chemistry, were screened against a complex of p38a and 
PBBA as follows. Samples containing 10 to 30 micromolar 
concentration of unactivated p38a MAP kinase and 0.1 to 

10 1.0 millimolar concentration of PBBA and 0.1 to 1.0 
millimolar concentration of one of the potential 
inhibitors were obtained. For each sample a 2D ^H/H] 
NOESY experiment was performed at 4°C. This process 
identified 7 molecules that bound proximal to the 

15 terminal methyl of PBBA. Figure 2 shows the structure of 
PBBA, with the terminal methyl represented by an asterisk 
and structures of the 7 molecules where arrows indicate 
the regions of each molecule that contained atoms having 
NOE interactions with the terminal methyl of PBBA. 

20 Figure 3 shows exemplary NMR NOESY data for the 

ternary complex formed by p38a, PBBA and the 
TTM2001.082.B09 inhibitor molecule shown in the first 
panel of Figure 2a. As shown in Figure 3, NOE crosspeaks 
were observed for atoms of the aliphatic moiety of PBBA 

25 with atoms of the f luoro-phenyl moiety of the inhibitor 
TTM2001.082 .B09. The crosspeaks indicated that the atoms 
of the aliphatic moiety of PBBA, identified as atoms 1 to 
4 in the right hand panel of Figure 3, were proximal to 
the atoms of the f luoro-phenyl moiety of the inhibitor 

30 TTM2001.082.B09 that are labeled as atoms a and b in the 
right hand panel of Figure 3 . 
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NOE buildup experiments were performed on the 
sample containing p38a MAP kinase, PBBA and the inhibitor 
TTM2001.082.B09 and used to determine the distance 
between atoms 1 to 4 of PBBA and atoms a and b of the 
5 inhibitor TTM2 001 . 082 .B09 . Based on the NOE buildup 

experiments, the distance between the terminal methyl of 
PBBA and atom a of the inhibitor TTM2001 . 082 .B09 was 
determined to be 3 ± 0.5 A. 

The TTM2001.101.A09 bi-ligand compound (shown 

10 in Figure 4) was designed based on the NOE determined 
distances to contain a moiety similar to the PBBA 
molecule and a moiety similar to the inhibitor 
TTM2001.082.B09 joined by a thioether ( -CH 2 -CH 2 -S-CH 2 -) 
linker. The TTM2001 . 101 .AO 9 bi-ligand compound was 

15 synthesized as follows. 4- (6- (Acetylsulf anyl) hexyl) 

benzoic acid methyl ester (0.985 mmol) was deprotected, 
removing the thioacetate group, in a biphasic mixture 
containing potassium carbonate (4.34 mmol) in 
nitrogen-purged methanol (4 ml), water (2 ml) and 

20 tetrahydrofuran (2 ml) that was stirred at room 

temperature for 1.5 hrs under nitrogen. Esterified 
TTM2001.101.A09 (1- (4-Fluoro-3- (6- ( (4-methoxycarbonyl- 
phenyl ) hexyl sul f anyl ) methyl ) phenyl ) - lH-benzoimidazole - 5 - c 
arboxylic acid) was synthesized by then adding 

25 1- (3-Cloromethyl-4-f luor-phenyl) -lH-benzimidazole-5-carbo 
xylic acid (0.820 mmol) to the stirred deprotection 
mixture. Following work up by removing volatile solvent 
in vacuo, diluting with water, acidifying with 2N HC1 to 
pH < 1, diluting with brine and extracting with ethyl 

30 acetate, the esterified TTM2001 . 101 .A09 product was 
purified with flash chromatography (gradient 95:5 
dichloromethane/ methanol to 90:10 dichloromethane/ 
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methanol) . The Ester was removed from the purified 
product by stirring at room temperature for 15 hours with 
lithium hydroxide (1.25 mmol) in methanol (1 ml) and 
water (1 ml), followed by addition of another 1.25 mmol 
5 of lithium hydroxide hydroxide and stirring at room 
temperature for another 24 hours. TTM2001 . 101 .A09 was 
purified from the mixture by acidification with 2N HCl 
until pH < 2 and collection of the white precipitate by 
filtration and washing with water and ether. 

10 The ability of TTM2001 . 101 .A09 to bind to p38a 

was determined by comparing the degree of line broadening 
in ID X H NMR spectra for the compound in the presence of 
p38a compared to in the absence of p38a. As shown in 
Figure 4, significant line -broadening was observed in the 

15 ID l H NMR spectrum of 50 micromolar of TTM2001 . 101 .A09 in 
the presence of 10 \M of p38a (Figure 4b) compared to the 
spectrum obtained for 50 micromolar of TTM2001 . 101 .A09 in 
the absence of p38a (Figure 4a) , indicative of tight 
binding between p38a and TTM2001 . 101 . AO 9 . 

20 The TTM2001 . 101 .AO 9 bi-ligand compound and the 

fragments from which it was constructed were tested as 
inhibitors of p38a enzymatic activity in an assay 
measuring phosphorylation of myelin basic protein (MBP) 
by p38a. The results of the assay are shown in Figure 5, 

25 and indicated that TTM2001 . 101 .A09 bound to p38a with an 
IC 50 of 1.7 micromolar, which was greater than 100 fold 
tighter than either of the starting fragments. 
Correction of the IC 50 of TTM2001 . 101 .A09 for high ATP 
concentration indicated that the Ka for binding between 

30 p38a and TTM2001 .101 .A09 was about 300 nanomolar. 
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Example II 

Combinatorial Matching of Fragments with the MOTIF 

Library 



This example demonstrates the creation of a 
5 library of molecules, termed a MOTIF library, having 
sub- structural features or moieties that are commonly 
found in marketed drugs or other compounds that have been 
evaluated in a clinical setting. This example further 
describes the use of NMR ACE to screen a MOTIF library to 
10 obtain a bi-ligand that specifically binds a protein. 

A number of PBBA related compounds were 
screened with WaterLOGSY to identify those that bound to 
p38a. WaterLOGSY screening identified the ligands shown 
in Figure 6. Of the compounds shown in Figure 6, two were 
15 identified by cross-saturation experiments (with 10 pM 
unactivated p38a, 1.5 s to 3.0 s saturation with a train 
of IBURP pulses) to interact with TTM2001 . 082 . A10 as 
indicated by the arrows. 

A library, referred to as the MOTIF library, 
20 containing molecules having sub- structural features or 
moieties that are commonly found in marketed drugs, as 
well as other compounds that have been evaluated in a 
clinical setting was constructed as follows. A database 
of compounds that are either marketed as drugs or that 
25 have undergone clinical trials was created. The database 
contained over 3500 different chemical entities. The 
prevalence of particular fragments in the database was 
analyzed. For example, the diphenyl amine moiety was 
found in 96 out of 3882 compounds, or 2.5% of the 
30 compounds. Moreover, this moiety was found in multiple 
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pharmacological classes. Thus, diphenylamine was 
identified as a small molecular weight molecule that can 
be used as part of a screening library for NMR ACE. 

Small molecules that contain the diphenyl amine 
5 moiety were included in the MOTIF library. Other 

criteria for determining whether a molecule was to be 
included in the MOTIF library included a maximum 
molecular weight of 22 0 Daltons and chemical inertness 
under the assay conditions. Based on these criteria a 
10 diverse MOTIF library of 160 small drug- like compound 
fragments was arrayed in multiwell plates for screening. 

The MOTIF library members were characterized in 
terms of solubility and ID and 2D COSY NMR spectroscopy 
to obtain proton assignments. The members of the MOTIF 

15 library that were found to have favorable characteristics 
such as solubility and resolved assignable protons were 
then screened against samples containing a complex of 
p38a and one of the 6 PBBA- related compounds shown in 
Figure 6. The screen was repeated such that each member 

20 of the MOTIF library, that was found to have favorable 
characteristics, was screened against six different 
- samples containing p38a and, respectively, each of the 
PBBA- related compounds shown in Figure 6. Samples found 
to contain ternary complexes were further analyzed by NOE 

25 buildup experiments to determine distances between pairs 
of MOTIF library members and PBBA-related compounds in 
the ternary complexes. One pair of ligands that bound 
proximal to each other with p38a was TTM2001 . 082 .A10 and 
TTE2001.084 .47A. 
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Bi-ligand binding compounds are synthesized to 
have covalently attached moieties based on the chemical 
identities of and distances between the pairs of MOTIF 
library members and PBBA-related compounds that are found 
5 to be proximal in the ternary complexes. 

Example III 
Gene Family Focused Libraries with NMR ACE 

This example demonstrates the use of 
competition experiments and structure analysis combined 
10 with NMR ACE to design focused libraries targeted to a 
particular protein or family of proteins. 

Using the methods described in Example I, PBBA 
and SB203580 were found to bind to p38a to form a ternary 
complex where PBBA was proximal to SB203580. As shown in 

15 the upper panel Figure 7, NOE interactions were observed 
between atom 4 of PBBA and atoms a and b of SB203580. 
Based on NOE buildup experiments the distance between 
atom 4 of PBBA and atom b of SB203580 was determined to 
be 3 i 0.5 A. Thus, PBBA bound to p38a at a site that 

20 was proximal to the ATP common ligand binding site. 

Specificity pocket 

The location where PBBA binds to p38a was 
predicted as follows. SB203580 is a known inhibitor of 
p38a that shows competitive binding with ATP in enzymatic 
25 assays, indicating that it binds in the ATP site. 

Binding of SB203580 to the ATP site of p38a has also been 
observed with a crystal structure of the SB203580/p38a 
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complex shown in the lower panel of Figure 7 (see Wang et 
al. Structure 6:1117-1128 (1998) and Protein Data Bank 
entry lBL6.pdb). Cross-saturation competition studies 
indicated that SB203580 was not able to displace PBBA. 
5 Furthermore, PBBA was not able to displace a 

f luorescently tagged staurosporine in a fluorescence 
polarization experiment. Staurosporine is known to bind 
in the ATP site of p38a. Thus, both NMR and traditional 
displacement experiments indicated that PBBA did not bind 
10 in the ATP site. 



A peptide having the sequence I PTTPI TTTYFFFKKK 
(SEQ ID N0:1) is a known phosphorylation substrate for 
p38a as described, for example, in Chen et al. 
Biochemistry 39:2079-2087 (2000). This peptide could not 

15 displace PBBA from p38a in WaterLOGSY competition 
- experiments, indicating that the peptide and PBBA 

occupied different binding sites on p38a. However, PBBA 
was shown to inhibit phosphorylation of MBP protein by 
p38a as described in Example I and shown in Figure 5. 

20 Since the Mbp binding site includes, but extends beyond 
the peptide binding site, these various competition 
experiments suggest that PBBA binds in a part of the MBP 
substrate binding site that extends beyond the peptide 
binding site. This site is shown in the lower panel of 

25 Figure 7, and is referred to as the SL (specificity 

ligand) site, since extending a bi-ligand library into 
this site off of an ATP mimic might provide additional 
specificity. 

Based on the distances determined between PBBA 
30 and SB203580, the location where SB203580 binds to p38a, 
and the results of the competitive binding assays 
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described above, the relative locations of the binding 
regions of p38a for ATP, MBP, the peptide of SEQ ID NO:l 
and PBBA were determined. A schematic diagram showing 
the relative locations of these binding regions is 
5 provided in Figure 8, As shown in Figure 8, the ATP 
binding region (dark shaded region) is adjacent to the 
MBP binding region (white region) and within the MBP 
region is a region where PBBA binds (indicated by 
brackets) as well as a region where the peptide (SEQ ID 
10 NO:l, lightly shaded region) binds. As shown in Figure 
8, the region of p38a where PBBA binds is separate from 
the region of p38of where phosphorylation occurs. 



The locations of the binding regions of p38a 
for - ATP, Mbp, the peptide of SEQ ID NO:l and PBBA were 

15 further defined based on a structural comparison of 

p38a-like proteins as follows. Figure 9A shows a portion 
of the model of the p3 8a structure from Wang et al. supra 
(1998) (Protein Data Bank entry lBL6.pdb) which includes 
the regions diagramed in Figure 8 and where residues are 

20 color coded based on the degree of conservation between 
the residues of the p38a-like proteins. The degree of 
conservation was determined using PrISM (Yang and Honig 
Proteins 37:66-72 (1999)) and Psi-blast (Altschul et al . , 
Nucleic Acids Res. 25:3389-3402 (1997)). Residue 

25 conservation scores were obtained from the multiple 
structure and sequence alignments, which range from 
highly homologous (blue) to distantly related (red) 
residues. As shown in Figure 9, the region of p38a that 
binds PBBA is variable. Because the region where PBBA 

3 0 binds is variable it is termed a specificity ligand site 
of p38a. The ATP binding site has a conserved structure 
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and is referred to as a common ligand site of the 
p38a-like proteins. 



The peptide (SEQ ID NO:l) was modeled into the 
p3 8a structure based on its location in the PKCa crystal 
5 structures described in Nishikawa et al. J. Biol. Chem, 
272:952-960 (1997); Nair et al . J. Med. Chem, 
38:4276-4283 (1995); Songyang et al . Cur. Biol. 
4:973-982 (1994); Songyang et al. Mol . and Cell. Biol, 
16:6486-6493 (1996). Incorporating residue conservation 
10 scores as the starting point, computational docking 
simulations were performed with small molecules using 
GOLD and other known methods as described, for example, 
in Doucet and Weber, « Computer -Aided Drug Design" 
Academic Press (1996) . The simulations, showed that 
15 molecules, such as PBBA, docked into the specificity 
region indicated by the circle in Figure 9B. The 
location of the SB2 03580 ATP mimic is represented with a 
pentagon in Figure 9 and the relationship between PBBA 
and SB203580 is indicated with arrows. The arrows span 
20 the cone of area that can be occupied by the benzoic acid 
moiety of PBBA, when constrained to have the terminal 
methyl proximal to SB203580 according to the NOE of 
Figure 7. The electrostatic surface potential map of 
p38a was calculated using the Grasp algorithm (the Grasp 
25 algorithm is described, for example, in Micholl et al. 
Proteins : Strut . Func . and Genet . , 11:281-296, (1991)). 

Design of a focused library 



Based on the results described above, the 
region of p38a that binds PBBA is predicted to be a 
30 target for binding compounds having specificity for a 
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particular member of this gene sub-family. The proximity 
of this specificity ligand site to the relatively 
conserved ATP site indicates that a bi-ligand library can 
be constructed in which a common ATP or ATP-like moiety 
5 is linked to one of a variety of moieties that are 

similar to PBBA or that bind to the same site as PBBA. A 
moiety that binds to the same site as PBBA is determined 
by docking a model of the mioety to the PBBA binding 
site, by structural comparison to PBBA or by identifying 
10 ligands that bind to p38a in an in vitro binding assay. 

In order to create a focused library that is 
specific to p38a and related protein kinases, moieties 
can be chosen based on specif icity for the PBBA binding 
site of p38a compared to other p38a-like proteins. 

•5 Example IV 

Identifying Ligand Location with a Ligand-Probe 
Containing an Antenna Moiety 

This example demonstrates the use of a common 
ligand-probe having. an antenna moiety capable of 
0 detecting a proximal second ligand. This example further 
demonstrates discrimination of the relative position and 
orientation of second ligands using a common ligand-probe 
having an antenna moiety. 

Ligand-probe TTM2002 . 143 . A27 contains an ATP 
5 mimic core moiety covalently attached via an amine 

linkage to a 3-oxabutyl antenna moiety as shown in Figure 
10. The TTM2002.143.A27 ligand-probe was designed based 
on the binding orientation and position of the parental 
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protein kinase. The antenna moiety was placed such that 
it can extend from the core structure toward a proximal 
binding site. 

5 The ether linkage in the antenna moiety allows 

the terminal methyl group to be relatively isolated from 
the other protons in the ligand probe, thereby favoring 
observation of direct NOE transfer from the methyl to a 
proximal ligand. The ether linkage allows greater 

10 differentiation of direct NOE interactions between the 
methyl and a proximal ligand compared to indirect NOE 
interactions from a proximal ligand through the core 
moiety of the ligand probe to the methyl. Isolation of 
the methyl group due to distance from the other protons 

15 allows direct NOE transfer to be selectively observed by 
obtaining spectra at relatively short mixing times. 
Furthermore, the absence of vicinal protons minimizes 
relaxation effects for the methyl protons, thereby 
providing a stronger signal. 

20 From the MOTIF library, described in Example 

II, 25 compounds were identified that bound to the p38ct 
protein kinase at sites different from the common ligand. 
These 25 compounds were screened for proximal binding 
near the core of the parental common ligand as follows. 

25 Samples were obtained containing 100 to 1000 micromolar 
concentrations of the TTM2002 . 143 . A27 ligand-probe, 10 to 
50 micromolar concentrations of activated p3 8oc protein 
kinase, and one of the 25 compounds at a concentration of 
100 to 1000 micromolar. The samples were screened for 
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proximal ligand interactions by ^H, X H) 2D NOESY 
acquisitions using mixing times of 100-1200 msec at 4°C. 

Among the 25 compounds screened, 
p-chloro-phenol (PCP, TTE0020 . 003 . AOS) exhibited NOEs to 
5 protons located within the antenna moiety of the 

TTM2002.143.A27 ligand-probe . As summarized in Figure 
10, for the p38a-TTM2002 . 143 . A27-PCP ternary complex, 
NOEs were identified between the aromatic hydrogen 
protons of PCP and the aliphatic protons in the antenna 

10 of TTM2002.143.A27. No NOE crosspeaks of significant 
intensity were observed from PCP to aromatic protons in 
the core of the parental common ligand in NOESY spectra. 
Thus, the binding site of PCP appeared to be restricted 
to a location on the surface of p38ot that was within 6.0 

15 A of the antenna moiety, but at a distance greater than 
6 . 0 A from the aryl rings in the core moiety of 
TTM2002.143.A27. Inter-ligand distance between the PCP 
ligand and antenna moiety are determined based on 
inspection of the build-up of intensity in NOESY 
20 interactions as a function of mixing time (tJ . 

Also among the 25 compounds screened, the 
TTE0020.003.A09 ligand was found to bind proximal to the 
TTM2002.143.A27 ligand-probe . As summarized in Figure 
11, NOE cross peaks were observed between aliphatic 

25 protons of the TTE0020 . 003 . A09 second ligand and the core 
moiety of the ligand probe. However, no significant 
inter-ligand NOE cross-peaks were observed to the 
antenna -probe. Thus, the binding site of TTE0020 . 003 . A09 
appeared to be at a location of p38oc that was within 6.0 

3 0 A of the aryl rings in the core moiety of 
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TTM2002.143 .A27, but at a distance greater than 6.0 A 
from the antenna moiety. 

Comparison of the NOEs observed between the 
ligand-probe and PCP with the NOEs observed between the 
5 ligand-probe and TTE0020 . 003 . A09, indicates that addition 
of the antenna provides information discriminating 
between the locations of the differing binding sites for 
second ligands. 



Example V 

10 Identification of Proximal Ligands by Selective Cross 

Saturation of an Antenna Moiety 

This example demonstrates identification of 
proximal ligands using selective cross saturation of 
protons of an antenna moiety attached to a ligand probe. 

15 Ligand-probe TTM2002 . 143 . A27 was obtained as 

described in Example IV. The protons of the terminal 
methyl group of the antenna moiety can be selectively 
saturated compared to other protons of the ligand probe 
because the methyl is isolated by the adjacent ether 

20 group and because the frequency of saturation for the 
methyl protons is different from that of the aromatic 
ring protons. 

The 25 compounds from the MOTIF library 
described in Example IV were screened for proximal 
25 binding near the ligand-probe as follows. Samples were 
obtained containing 100 to 1000 micromolar concentrations 
of the TTM2002 .143.A27 ligand-probe, 10 to 50 micromolar 
concentrations of activated p38a protein kinase, and one 
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of the 25 compounds at a concentration of 100 to 1000 
micromolar . Samples were screened for the presence of 
second ligands binding proximal to the common 
ligand-probe by ( X H) 1 dimensional saturation transfer 
5 difference experiments using saturations times of 2s 

(frequency selective excitation via a train of 232 6ms n 
pulses with a GAUSS profile at an 80Hz RF field strength) 
with solvent suspension using WATERGATE at 4°C. 

Among the compounds screened, the aromatic 
10 protons of p-pentyl -aniline (PPA) showed a reduction in 
the intensity on selective saturation of the methyl 
protons located within the terminus of the antenna-probe 
(2s saturation at 3.0 ppm) , relative to a control 
experiment (with of f -resonance saturation 5000Hz 
15 up- field) . As summarized in Figure 12 intensity changes 
were observed for two aromatic protons in a second ligand 
(TTE0020.003.A09) indicating that they were proximal to 
the antenna probe when bound to p38a. 



These results demonstrate that proximally bound 
20 ligands can be identified by observing reduced intensity 
of resonances for protons in a second ligand that bind 
close to an antenna moiety for which the proton 
resonances have been selectively saturated. Such 
one -dimensional NMR experiments can be performed in 
25 minutes, allowing a roughly ten-fold reduction in the 
screening time per compound compared to 2D NOESY based 
methods. Such a pre-selection approach can be applied 
prior to detailed characterization by 2D ( X H # X H) NOESY or 
as an alternative to 2D appraoches, to increase screening 
3 0 throughput and reduce instrumentation demands. 
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Throughout this application various 
publications, patents and patent applications have been 
referenced. The disclosures of these publications, 
patents and patent applications in their entireties are 
5 hereby incorporated by reference in this application in 
order to more fully describe the state of the art to 
which this invention pertains. 

Although the invention has been described with 
reference to the examples provided above, it should be 
10 understood that various modifications can be made without 
departing from the spirit of the invention. Accordingly, 
the invention is limited only by the claims. 
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What is claimed is: 

1. A method for obtaining a focused library of 
candidate binding compounds for a protein family, wherein 
the members of the protein family bind a common ligand, 
5 comprising the steps of: 

(a) observing competitive binding of the 
common ligand and a first ligand to a protein, wherein 
the protein is a member of the protein family, thereby 
determining that the first ligand binds to the common 

10 ligand binding site of the protein; 

(b) providing a sample comprising the protein, 
the first ligand and a second ligand under conditions 
wherein the first ligand, the second ligand and the 
protein form a bound complex; 

15 (c) detecting magnetization transfer between 

the first ligand and the second ligand in the bound 
complex, thereby determining that the two ligands are 
proximal in the bound complex; and 

(d) obtaining a population of candidate 

20 binding compounds comprising the first ligand, or a 

fragment thereof, linked to one of a plurality of second 
ligand homologs, 

whereby the population contains binding 
compounds that bind to members of the protein family. 

25 2. The method of claim 1, wherein the protein 

contains a natural abundance of isotopes for carbon or 
nitrogen. 



3. The method of claim 1, wherein the protein 
is present in the sample at a concentration less than 200 
3 0 micromolar. 
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4. The method of claim 1, wherein the protein 
has a monomeric molecular weight greater than 20 kDa. 

5. The method of claim 1, wherein the protein 
is deuterium labeled. 

5 6. The method of claim 1, wherein the common 

ligand is a natural common ligand. 

7. The method of claim 1, wherein the common 
ligand is a mimic of a natural common ligand. 

8. The method of claim 1, wherein the 

10 equilibrium dissociation constant for the first ligand 
and the protein absent the second ligand, but otherwise 
under the conditions, is greater than 1 x 10" 6 Molar. 

9. The method of claim 8, wherein the 
equilibrium dissociation constant for the second ligand 

15 and the macromolecule absent the first ligand, but 

otherwise under the conditions, is greater than 1 x 10" 6 
Molar. 

10. The method of claim 1, wherein the first 
ligand is identified by a method comprising measuring 

20 cross-saturation for a bound complex comprising the first 
ligand bound to the macromolecule. 

11. The method of claim 10, wherein the cross- 
saturation is measured using WaterLOGSY. 



WO 03/023392 



PCT/US02/28745 



4 



95 

12. The method of claim 1, wherein the first 
ligand is identified by obtaining a structure model for 
the binding site of the macromolecule and docking a 
structure model of the first ligand with the structure 

5 model of the binding site. 

13. The method of claim 1, further comprising 
determining that the second ligand binds to a different 
location on the protein from the common ligand. 

14. The method of claim 1, wherein step (c) 
10 further comprises measuring magnetization transfer at a 

temperature below 10° C. 

15. The method of claim 14, wherein measuring 
magnetization transfer comprises obtaining a 2D ( Z H, ^H) 
NOESY spectrum. 

15 16. The method of claim 1, wherein step (c) 

further comprises identifying an atom of the first ligand 
that is proximal to an atom of the second ligand. 

17. The method of claim 16, further comprising 
determining the distance between the atom of the first 

20 ligand that is proximal to the atom of the second ligand. 

18. The method of claim 17, wherein the linker 
connects the first ligand, or fragment thereof, and the 
second ligand homolog such that the first ligand, or 
fragment thereof can be separated from the second ligand 

25 homolog by the distance separating them in the bound 
complex. 



WO 03/023392 PCTYUS02/28745 



96 

19. The method of claim 16, wherein the linker 
connects the first ligand and the second ligand homolog 
at the positions of the proximal atoms of the first and 
second ligands. 

5 20. The method of claim 1, further comprising 

repeating steps (b) through (d) , wherein the first ligand 
is replaced by the binding compound obtained in step (d) 
and the second ligand is replaced by another ligand. 

21. A method of identifying a compound that 
10 binds a protein, comprising the steps of: 

(a) obtaining a focused library of candidate 
binding compounds for a protein family according to claim 
1; 

(b) assaying the focused library for the 

15 ability to bind to a protein, wherein the protein is a 
member of the protein family; and 

(c) identifying a member of the focused library 
that binds to the protein. 
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22. A method for obtaining a focused library 
of candidate binding compounds, wherein the members of 
the protein family bind a common ligand, comprising the 
steps of: 

5 (a) providing a plurality of samples 

comprising the protein and a first ligand under 
conditions wherein the first ligand and the protein form 
a bound complex, wherein the protein is a member of a 
family of proteins that bind a common ligand; 
!0 (b) assaying a population of candidate second 

ligands for the ability to transfer magnetization to the 
first ligand in a sample from the plurality; 

(c) identifying, from the population of 
candidate second ligands, a second ligand that transfers 

15 magnetization to the first ligand, thereby determining 
that the two ligands are proximal to each other in a 
ternary bound complex with the protein; 

(d) observing competitive binding between one 
of the two ligands and the common ligand, thereby 

20 determining that the competitive binding ligand binds to 
the common ligand binding site of the protein; and 

(e) obtaining a population of candidate 
binding compounds comprising the competitive binding 
ligand, or a fragment thereof, linked to one of a 

25 plurality of homologs of the other ligand, 

whereby the population of candidate binding 
compounds contains binding compounds that bind to members 
of the protein family. 



23. The method of claim 22, wherein the 
30 population of candidate second ligands comprise a MOTIF 
library. 
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24. The method of claim 22, wherein the 
protein contains a natural abundance of isotopes for 
carbon or nitrogen. 

25. The method of claim 22, wherein the 

5 protein is present in the sample at a concentration less 
than 200 micromolar. 

26. The method of claim 22, wherein the 
protein has a monomeric molecular weight greater than 30 
kDa . 

10 27. The method of claim 22, wherein the 

protein is deuterium labeled. 

28. The method of claim 22, wherein the common 
ligand is a natural common ligand. 

29. The method of claim 22, wherein the common 
15 ligand is a mimic of a natural common ligand. 

30. The method of claim 22 wherein the 
equilibrium dissociation constant for the first ligand 
and the protein absent the second ligand, but otherwise 
under the conditions, is greater than 1 x 10~ 6 Molar. 

20 31. The method of claim 30, wherein the 

equilibrium dissociation constant for the second ligand 
and the macromolecule absent the first ligand, but 
otherwise under the conditions, is greater than 1 x 10" 6 
Molar. 
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32. The method of claim 22, wherein the first 
ligand is identified by a method comprising measuring 
cross-saturation for a bound complex comprising the first 
ligand bound to the macromolecule. 

5 33. The method of claim 32, wherein the cross- 

saturation is measured using WaterLOGSY. 

34. The method of claim 22, wherein the first 
ligand is identified by obtaining a structure model for 
the binding site of the macromolecule and docking a 

10 structure model of the first ligand with the structure 
model of the binding site. 

35. The method of claim 22, wherein the first 
ligand is identified by a binding assay. 

36. The method of claim 22, further comprising 
15 determining that the competitive binding ligand binds to 

a different location on the protein from the other 
ligand. 



37. The method of claim 22, wherein step (c) 
further comprises measuring magnetization transfer at a 

20 temperature below 10° C. 

38. The method of claim 37, wherein measuring 
magnetization transfer comprises obtaining a 2D OH/H) 
NOESY spectrum. 



39. The method of claim 22, wherein step (c) 
25 further comprises identifying an atom of the first ligand 
that is proximal to an atom of the second ligand. 
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40. The method of claim 39, further comprising 
determining the distance between the atom of the first 
ligand that is proximal to the atom of the second ligand. 

41. The method of claim 40, wherein the linker 
5 connects the first ligand, or fragment thereof, and the 

second ligand homolog such that the first ligand, or 
fragment thereof can be separated from the second ligand 
homolog by the distance separating them in the bound 
complex. 

10 42. The method of claim 39, wherein the linker 

connects the first ligand and the second ligand homolog 
at the positions of the proximal atoms of the first and 
second ligands. 

43. The method of claim 22, further comprising 
15 repeating steps (a) through (e) , wherein the first ligand 

is replaced by the binding compound obtained in step (e) 
and the second ligand is replaced by another ligand. 

44. A method of identifying a compound that 
binds a protein, comprising the steps of: 

20 (a) obtaining a focused library of candidate 

binding compounds for a protein family according to claim 
22; 

(b) assaying the focused library for the 
ability to bind to a protein, wherein the protein is a 

25 member of the protein family; and 

(c) identifying a member of the focused library 
that binds to the protein. 
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45. The method of claim 22, wherein the first 
ligand is identified by assaying a population of 
candidate first ligands for the ability to bind the 
protein and identifying from the population of candidate 

5 first ligands a first ligand that binds to the protein. 

46. The method of claim 22, wherein the 
population of candidate first ligands comprise a MOTIF 
library. 
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47. A method for obtaining a binding compound 
having specificity for a member of a protein family, 
wherein the members of the protein family bind a common 
ligand, comprising the steps of: 
5 (a) observing competitive binding of the 

common ligand and a first ligand to a first protein, 

(b) observing competitive binding of the 
common ligand and a first ligand to a second protein, 

wherein the first and second proteins are 
10 members of the protein family, thereby determining that 
the first ligand binds to the common ligand binding site 
of the first and second proteins; 

(c) providing a sample comprising the first 
protein, the first ligand and a second ligand; 

15 (d) providing a sample comprising the second 

protein, the first ligand and the second ligand; 

(e) comparing the degree of magnetization 

transfer between the first ligand and the second ligand 

for the samples of parts (b) and (c) ; and 
20 (f) obtaining a binding compound comprising 

the first ligand, or a fragment thereof, linked to the 

second ligand, or a fragment thereof, 

whereby the binding compound selectively binds 

the first protein compared to the second protein. 

25 48. The method of claim 47, wherein the 

proteins contain a natural abundance of isotopes for 
carbon or nitrogen. 



49. The method of claim 47, wherein the 
proteins are present in the samples at a concentration 
30 less than 200 micromolar. 
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50. The method of claim 47 wherein the 
proteins have a monomeric molecular weight greater than 
30 kDa. 

51. The method of claim 47, wherein the 
5 proteins are deuterium labeled. 

52. The method of claim 47, wherein the 
binding compound has at least 2 fold higher affinity for 
the first protein compared to the second protein. 

53. The method of claim 47 , wherein the degree 
10 of magnetization transfer is determined by measuring 

cros s - satura t i on . 

54. The method of claim 53, wherein the cross- 
saturation is measured using WaterLOGSY. 

55. The method of claim 47, wherein step (e) 
15 further comprises measuring magnetization transfer at a 

temperature below 10° C. 

56. The method of claim 55, wherein measuring 
magnetization transfer comprises obtaining a 2D ^H, 1 !!) 
NOESY spectrum. 

20 57. The method of claim 47, wherein step (e) 

further comprises identifying an atom of the first ligand 
that is proximal to an atom of the second ligand. 

58. The method of claim 57, further comprising 
determining the distance between the atom of the first 
25 ligand that is proximal to the atom of the second ligand. 



WO 03/023392 



PCT7US02/28745 



104 

59. The method of claim 58 , wherein the linker 
connects the first ligand, or fragment thereof, and the 
second ligand homolog such that the first ligand, or 
fragment thereof can be separated from the second ligand 

5 homolog by the distance separating them in the bound 
complex. 

60. The method of claim 58, wherein the first 
ligand, or fragment thereof, and the second ligand, or 
fragment thereof, are linked at the positions of the 

10 proximal atoms of the first and second ligands. 

61. The method of claim 47, wherein the first 
ligand comprises a ligand-probe having an antenna moiety. 

62. The method of claim 61, wherein step (e) 
comprises comparing the degree of magnetization transfer 

15 between the antenna moiety of the ligand-probe and the 
second ligand for the samples of parts (b) and (c) . 

63. The method of claim 61, wherein the ligand- 
probe comprises a common ligand attached to the antenna 
moiety. 
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64. A method for obtaining a focused library 
of candidate binding compounds for a protein family, 
wherein the members of the protein family bind a common 
ligand, comprising the steps of: 
5 (a) providing a ligand-probe having an antenna 

moiety, wherein the ligand-probe binds to the common 
ligand binding site of a protein, wherein the protein is 
a member of the protein family; 

(b) providing a sample comprising the protein, 
10 the ligand-probe and a second ligand under conditions 

wherein the ligand-probe, the second ligand and the 
protein form a bound complex; 

(c) detecting magnetization transfer between 
the antenna moiety of the ligand-probe and the second 

15 ligand in the bound complex, thereby determining that the 
antenna moiety and second ligand are proximal in the 
bound complex; and 

(d) obtaining a population of candidate 
binding compounds comprising the ligand-probe, or a 

20 fragment thereof, linked to one of a plurality of second 
ligand homologs, 

whereby the population contains binding 
compounds that bind to members of the protein family. 



65. The method of claim 64, wherein the ligand- 
25 probe comprises a common ligand attached to the antenna' 

moiety. 

66. The method of claim 65, wherein step (d) 
comprises obtaining a population of candidate binding 
compounds comprising the common' ligand, or a fragment 

30 thereof, linked to one of a plurality of second ligand 
homologs . 
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67. The method of claim 65 , wherein candidate 
binding compounds in the population of step (d) have a 
linkage between the antenna moiety and a second ligand 
homolog. 

5 68. The method of claim 65, wherein the common 

ligand is a natural common ligand. 

69. The method of claim 65, wherein the common 
ligand is a mimic of a natural common ligand. 

70. The method of claim 64, wherein the ligand 
probe has a plurality of antenna moieties. 

71. The method of claim 70, wherein step (d) _ 
comprises detecting magnetization transfer between the 
antenna moieties of the ligand-probe and the second 
ligand in the bound complex, thereby determining that the 
antenna moieties and second ligand are proximal in the 
bound complex. 

72. The method of claim 64, further comprising 
a step of observing competitive binding of a common 
ligand and the ligand-probe to the protein, thereby 
determining that the ligand-probe binds to the common 
ligand binding site of the protein. 

73. The method of claim 72, wherein binding of 
the ligand-probe is identified by a method comprising 
measuring cross -saturation for a bound complex comprising 
the first ligand bound to the macromolecule. 
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74. The method of claim 73, wherein the cross- 
saturation is measured using WaterLOGSY. 

75. The method of claim 64, wherein the 
protein contains a natural abundance of isotopes for 

5 carbon or nitrogen. 

76. The method of claim 64, wherein the 
protein is present in the sample at a concentration less 
than 200 micromolar. 

77. The method of claim 64, wherein the 

10 protein has a monomeric molecular weight greater than 3 0 
kDa. 

78. The method of claim 64, wherein the 
protein is deuterium labeled. 

79. The method of claim 64, wherein the ligand- 
15 probe is identified based on visual inspection of a 

structure model for the binding site of the 
macromolecule . 

80. The method of claim 64, wherein step (c) 
further comprises measuring magnetization transfer at a 

20 temperature below 10° C. 

81. The method of claim 80, wherein measuring 
magnetization transfer comprises obtaining a 2D ^H/H) 
NOESY spectrum. 
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82. The method of claim 64, wherein step (c) 
further comprises identifying an atom of the antenna 
moiety that is proximal to an atom of the second ligand. 

83. The method of claim 82, further comprising 
5 determining the distance between the atom of the antenna 

moiety that is proximal to the atom of the second ligand. 

84. A method of identifying a compound that 
binds a protein, comprising the steps of: 

(a) obtaining a focused library of candidate 

10 binding compounds for a protein family according to claim 
64; 

(b) assaying the focused library for the 
ability to bind to a protein, wherein the protein is a 
member of the protein family; and 

15 (c) identifying a member of the focused library 

that binds to the protein. 
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85. A method for obtaining a focused library 
of candidate binding compounds, wherein the members of 
the protein family bind a common ligand, comprising the 
steps of : 

5 (a) providing a ligand-probe having an antenna 

moiety, wherein the ligand-probe binds to the common 
ligand binding site of a protein, wherein the protein is 
a member of the protein family; 

(b) providing a plurality of samples 
10 comprising the protein and the ligand-probe under 

conditions wherein the ligand-probe and the protein form 
a bound complex, wherein the protein is a member of a 
family of proteins that bind a common ligand; 

(c) assaying a population of candidate second 
15 ligands for the ability to transfer magnetization to the 

antenna moiety of the ligand-probe in a sample from the 
plurality; 

(d) identifying, from the population of 
candidate second ligands, a second ligand that transfers 

2 0 magnetization to the antenna moiety of the ligand-probe, 
thereby determining that the two ligands are proximal to 
each other in a ternary bound complex with the protein; 
and 

(e) obtaining a population of candidate 
25 binding compounds comprising the ligand-probe, or a 

fragment thereof, linked to one of a plurality of. 
homologs of the other ligand, 

whereby the population of candidate binding 
compounds contains binding compounds that bind to members 
30 of the protein family. 
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86. The method of claim 85, wherein the ligand 
probe comprises a common ligand attached to the antenna 
moiety. 

87. The method of claim 86, wherein step (e) 
5 comprises obtaining a population of candidate binding 

compounds comprising the common ligand, or a fragment 
thereof, linked to one of a plurality of second ligand 
homologs . 

88. The method of claim 86, wherein candidate 
10 binding compounds in the population of step (e) have a 

linkage between the antenna moiety and a second ligand 
homolog. 

89. The method of claim 86, wherein the common 
ligand is a natural common ligand. 

15 go. The method of claim 86, wherein the common 

ligand is a mimic of a natural common ligand. 

91. The method of claim 85, wherein the ligand 
probe has a plurality of antenna moieties. 

92. The method of claim 91, wherein step (c) 
20 comprises assaying a population of candidate second 

ligands for the ability to transfer magnetization to the 
plurality of antenna moiety of the ligand-probe in a 
sample from the plurality. 
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93. The method of claim 85, further comprising 
a step of observing competitive binding of a common 
ligand and the ligand-probe to the protein, thereby 
determining that the ligand-probe binds to the common 

5 ligand binding site of the protein. 

94. The method of claim 93, wherein binding of 
the ligand-probe is identified by a method comprising 
measuring cross -saturation for a bound complex comprising 
the first ligand bound to the macromolecule. 

10 95. The method of claim 94, wherein the cross- 

saturation is measured using WaterLOGSY. 

96. The method of claim 85, wherein the 
protein contains a natural abundance of isotopes for 
carbon or nitrogen. 

15 97. The method of claim 85, wherein the 

protein is present in the sample at a concentration less 
than 200 micromolar. 

98. The method of claim 85, wherein the 
protein has a monomeric molecular weight greater than 3 0 

20 kDa. 

99. The method of claim 85, wherein the 
protein is deuterium labeled. 

100. The method of claim 85, wherein the 
ligand-probe is identified based on visual inspection of 

25 a structure model for the binding site of the 
macromolecule. 
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101. The method of claim 85, further 
comprising determining that the second ligand binds to a 
different location on the protein from the common ligand. 

102. The method of claim 85, wherein step (c) 
5 further comprises measuring magnetization transfer at a 

temperature below 10° C. 

103. The method of claim 102, wherein measuring 
magnetization transfer comprises obtaining a 2D ( X H # 1 H) 
NOESY spectrum. 

10 104. The method of claim 85, wherein step (d) 

further comprises identifying an atom of the antenna 
moiety that is proximal to an atom of the second ligand. 

105. The method of claim 104, further 
comprising determining the distance between the atom of 

15 the antenna moiety that is proximal to the atom of the 
second ligand. 

106. A method of identifying a compound that 
binds a protein, comprising the steps of: 

(a) obtaining a focused library of candidate 

20 binding compounds for a protein family according to claim 
85; 

(b) assaying the focused library for the 
ability to bind to a protein, wherein the protein is a 
member of the protein family; and 

25 (c) identifying a member of the focused library 

that binds to the protein. 
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107. The method of claim 85, wherein the 
ligand-probe is identified by assaying a population of 
candidate ligand-probes for the ability to bind the 
protein and identifying from the population a ligand- 
5 probe that binds to the protein. 
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